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(54) Apoptosls-regulating proteins 

(57) Adenovirus E1 B 1 9 kD protein protects against 
cell death induced by viral infection and certain external 
physical and chemical stimuli. Activity of the 19 kD pro- 
tein is similar to the cell death suppressing activrty of the 
protein coded by the Bcl-2 protooncogene. Bcl-2 protein 
can functionally substitute for the E1B 19 kD protein 
during adenovirus infection and in transformation of pri- 
mary cells with adenovirus El A. Five different cDNA's 
for proteins, designated Nip1, Nip2, Nip3, BipfA and 
Bip13 that specifically interact with the 19 kD protein 
were found. Mutational analysis of the interaction indi- 
cates that at least four of the proteins (Npl , Nip2, Nip3 
and Bipl A) associate with 1 9 kD protein at specif* sites 
thereof. Homologous motifs are found on Bd2. An addi- 
tional protein, (Bip5), interacts with Bd-2 but not with 
the 19K protein. 
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Description 



5 «*» ^ one or more genetic p^ms A^jf iT^T? The onset of apoptosis arises from an acti- 

niutarteisremirtscemof thirtobeervedduriro in celte irrf «** with E1B 19K 

*. 1992). SMh* «» 19 M> ml* pSSSKSISIrii !£ " " • : «««'*.-F*Mg. « 
«**ug c*Mn <&i»ma™, « al .tarn «7T» rj^T^°T*™0^ 

as the anti-can- 

^theinS^^ 

19K can efficiency suppress^. SShtS^SS^S^ tl^ ^ * aL ' 1993; ^ 1993 >^ 
1993). it appears that 19K suppresses ^V^L^SnT^ (Subramanian « 1993; Tarodi et al., 

a... 1988; Sentman et a.^991; Strat a ^£21^ °* by a P°^ 6 ^ 

treatment with glucocorticoids (Alnemri et d dSS.^^ ^ ^ inhbits a P°P^ induced by 

» ^"^tioncrfthec-r^ 19*0; Hocke*ery et al. 

and other DMA damaging agents ^arodi STSf * *' 1992,1 radatk)n et al. i m j 

substtute for the 19 kD protein during aTnTn^n^ * ^ ** ,he «*« P«*ein can 

35 fay Man with Ad2 19K mutants is ^eJS^Z^^T 5*""*" <* DNA induced 

m»1 and the Ite Tl» simtarity ^S^,^ mol8ai « •"*»> hetata BHOF-t. bd-X, 

:^^*^«~^^^ 

Another object of the instant Invention is to provide olioonantirtoc . 

In another embodiment the prese* invenS ■toSSSSSl F* °t re ° utetin 0 P roteins 
above-mentioned nucleic acid sequences ^^^i^T!^^ molBailBB hybridize to the 
molecule which regulates cellTr^ T^tS^onTS^^ 8 f 0 ****** 6 *> * site of a 

under stringent conditions. is under conventional hybridization conditions, preferably 
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anr^l ° f ^ P ' GSm invention «" erther be DNA ° r ™A molecules. The invention also 

Z^ZZrT^LV? 1 anti - S6nSe RNAS ° r Spedfk: riboz ^es which allow the cclTof £ 

t%^Z™~*T "fy!?" inVemk3n in *** h0St 3nd Ce,IS 8* RNAs 

ZlZFj *>°zymes are also contemplated by the present invention Furthermore, in another embodiment the 
s .ronton relates to pnmers which allow the specific amplication of nucleic acid molecules of the pJlSn. 
of any specif.c parte thereof. In another embodiment, the inv^on relates to probes that spe^Jv^S^o^ 
acid sequences of the present invention or to any part thereof. "P««"cai.y nyonaise to nucleic 

inu»,lirJ her h e T^ dimen1 ' ** pr86em invention rfl,ate8 to ««pri»ng said DMA molecule of the present 
invention. Such vectors are preferably plasmids, cosrrids or phage vectors. prB6em 

^ThepronxrterisselertedsoastoBJtowexpreKicflof 

, s Th«« ^"Lf n *° d ! men, L the PreSent inventi0n relates t0 host cel,s «»** contain the vector of the present invention 

iT. ^ 2£* Eschenchia E « m P tes of P"*"* e«Karyotic cells are yeast or animal cells 
cuJeTS^^ 

ln anrther embodiment, the present invention relates to pharmaceutical and diagnostic cormosrtions comorisina 
o the nucleic aod molecules, the polypeptides or the oli fl()P eptkles of the present invention ^ ^ 

and I^S^ ^ °» a «*• - which bind to the E1B 19K 

Se^twS of census sequences shared by the proteins which mediate the binding 

Figure 1 depicts the functional organization of the 19 kO prrtein. The 19 IC sequent inwlved in suMression of 
s conduced cell death and E 1 A™>perative transformation are based on Sub^n^etT^)3eT^ 
of toefigure. solid line bars indicate strong effects of ^aMcn^mtu^^S^^X^ 

x^SSSSXT ^ Wrth l ,h0Se Sequences <* «* in box*) are shown at the bottom of the figure (resioW^ 
. SOSES' ^k 90 " 98 * SE ° 10 residues 106 " 112 * SEQ ID and reswu« ^foTsfQ 
^S^SST?^ 6 hUma " ** Pr0t8in iS a ^ form isolated ,rom a 'ymphoma and corSSa 
a?VSl2°2 '"f 8 ^onja (Seto et al. ,988. Hockenbery et ai., 1990) and J^JE£S£Z 

i^JSSL ^ lnd,Cated ^ "* ' ineS ' **" amin ° acWs * ^ and daantry retatai m 

SEQ^fS)^ 2 ^! ° It? S ^ UenCe homolo 8 ies - Fi 8 ure 2 * Homology between Ni P 1 (amino acids 43-100 of 
NO %£J£Z1TL , * , ^l° f ^ Calm0dU,in W^ndent phosphodiesterase (PDE) (SEQ ID 

SZZ St?/ ** ■!? Und6r1ined feSidUeS are «"« "« conform to the^nsensus. fSHfc HmC 

RhoGAP (SEQ ID N035). (Barfed et al., 1993; Lancaster et al.. 1994) 

Pur^Ta^^ 

interS'SrS TO, m ™*° n '* to include modifications to the molecules of 

^^fL?^^.!^ 0 ' fua ° n ' ^^"a 10 capers and the like which brings about a physical chanae in the 
Z"l "^ te b l doe6 not mer «*ste"tially the biologic function thereof in relationship to theS S at 
^ * ™y so tong as a decirable and toleratte Lei TZ^T^i 

T% * ^ 8 characteri stic is enhanced, for example, solubility in an «SoS m^umS? 

" ^r^"^^ ^ ^ a may yie W actMtyS « ^^^Te^em nS^Te 

cminstances. is not a sine qua non condition in the Instant invention. A key determinant, thus, is nrt the d»S 

^eTu^rect^ 

urea is fte guidng directive as to the molecules of the instant invention. Accordingly, by way of example, any one or al 
S anTe^u V2T« m^, ^ fTS?* 88 ta 106 lwel 01 ««« degree is mandated by the design. 

can i e r;SC r " Ulflte ^ ** Ce " ^ ^ 56 - P' 0 ^ - eel, survival 
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an n h c erei „ nabove ' * ere are ™^<* "hich have the effect of prolonging cell survival. Examples are Bcl-2 

. . U f Pr0te,nS t0 SUbv6rt ap0pt0sis ^ P rolon 9 cell survival. To prolong cell survival is 

to generate cells that statistically survive longer than expected. s>urv.vaus 

* ronJ^ "J™ 9eneti ° Stratefly desi9nated donin 8 (FieWs and Song, 1989; Chien et al.. 1991) permits 

SrTrZZ^Z^ m0l6CUleS ^ a860Ciate ■* the ^ "as been used to isolate from e^SS 
Itxanes cDNA clones that code for proteins interacting with several known proteins. 

^e^^TTl^J? tranSformat, ° n ^ /"St hosts with plasmids that encode fusion proteins. 

One plasmd carries partial sequences for a reporter molecule, for example, the GAL4 DNA binding domain at the 

„ Z^ZZ t^T ^ 101 » ^ * « Bgand is sougr? a^own a 
10 the Dart at the carboxyl terminus. For example, the bait can be the 19K protein « Bcl-2 protein 

The second plasmid comprises sequences encoding a cornplernerrtary protein for the reporter molecule in the 
above case, reaped by the GAM DNA binding domain, such as the GAL4 activation dc^ln^e^terminus 
rZS^t * CDNA G ^ « ^ * the carboxyt terminus A suitable host is used to enTefre 

n*. ntl 6 ^" * d ^H t ! an J? rmantS afe thOSe tha< expr8SS P-aakctosidase. "ence would be blue colonies on an X- 
the two QAL4 regulatory elements required for p-galactosidase expression 

» or b^H^ ST " T 2 iS I" ^ ^ CDNA ' 8 enC0de PO^Mes that bind 19K. Bd-2 

S 2 T"* T 8 r0 ' e aP0ptoSiS Whether by ,unctional| y and direct * mutating one or more 

t Sy^^S? 08 ° f aP ° Pt0SiS " * aff6Ctina ^ SUrVh/al 6nhandn ° aCtivity ^ 19K w Bcl " 

i^^^lTJ^ ^SP 0 * 0 8yStem8, because d the tertiar y and quaternary configuration of molecules and the 

* ^ZtT^TZ^T^ * 3 m ° leCU,e <*" have k< * «* 88 « recognition function, a 

* signalling function, a catalybc function, a binding function and the like. For example, a specific antigenic determinant 

Scie to^ne no ™°<*9^ P^ity of short amino add stretches in the heavy and Hght chains tat varyfrcS 

Similarly, the binding interaction of an apoptosis-regulating protein, such as 19K and bcl-2 with another arjcotosis- 
o regt^ng molecule, such as Nipt. Nto2 , N fo3 and BiplA, disdosed herein. governed and I ml^eS^rt 
bycertun srtes of the prote^s which are contiguous portions of the proteins which comprise short conserved rrxrtfsor 
consensus sequences of amino acids. The motifs can comprise 15 amino acids or less? often 10 amino acidsTless 
Two motifs are AAPAPGFFS (SEQ ID NO:1, and YRRDFAE (SEQ ID NO:2) of B^XS^^mvSi 

Tl f a ° ' (SEQ ID NO:5). where b is a basic amino acid, x * any amino acid STis 

ilm^Ze 8 ! (SEQ ' D N ° fi) ' **• H i8 8 hyd ' Oph0biC *** add ' be impcxtarZ fte 

5? , AAPAPGIFS (SEQ ID NO:7) (deary, 1986). On the other hand, the motif obtained from a follicular lym- 
Pj^errved done of bd-2 was found to be AAPAPGFFS (SEQ ID NO:1). Also, the bd-2 coZsus may eXe 
some variability having a sequence YRGDFAE (SEQ ID N032) (Tsujimoto & Croce. 1986) 

f« J^r^Z^^li^ir!^ 6 imp0ftance <* the c 005 ^ «*>«*™ in the normal state and provides a dif- 
ferential marker for identjfyng and screening abnormal cells based on the amino acid substitution 

The instant nudeic acids and polypeptides may be obtained as described herein, that is by recombinant means or 

^iS^5 y '.* 9 ""^ aCKfe ^ P 01 ^^ may be synthesized in vitro using, for example, solid phase otiqo- 
nudeotjde and oligopeptide synthetic methods known in the art ^ P™ 88 

' irnJt^ i ^^^r n ^l^^^inq proteins, such as 1 9K and Bd-2, and the protehs of the instant 
invention that bind thereto may take a variety of forms. For example, the molecules may either promote or suooress 
•* ™ : 19K and Bd-2 type proteins may recruit the proteins of the i^^Zan^SZ 
cell survival by enhancng the activity thereof or by suppressing the activity thereof 

™° a ^ lab,li1y f r «idy sources of apoptosis-regulating proteins enables the manipulation of cells for a desired 
^lT Sa I CX fTJP e > « « bel,eved *at certain malignances arise from over-expression of an apoptosis-blocking 
gene such asbd-2. In that case, application or administration of an instant protein, or nudeic add encoding said prr> 

ZTl^ ^ ^ Bd ' 2 - ' n *° C0W8ree - W of a proton such 2 Bd-2^1. 

enhance the lire span of a cell. 
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30 



and the ce,, or ttssue^ " " **** *°'"ule * SS^onS ^ "» 

fransformation methods now us*Hent ST" ^ be ^ to ™"tain valuable celte w*h„„ * 
be transtected by an expressablTn/JT ° medium contain one or Zl ^ * USual W 

-near*, such as micaS^^ !°l encod,n 9 *• ^ed pZrSjT^'S^ 0 * 6 " 6 <* ce,te <*" 
rt ° host genome. tC^^I^? 0 "' ^ «^ vector can be™ £^,12**" c " n 
w fragment to ensure p,^^ ^ deme ^ '^^ZT a 2?™* w 

» . The instan, ^teoT^^^'**™* * aberrant ceCZ ZZT, «"*>■ 
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^^^^^S^^"^ —^neratvediseases, ^..a^ 

a specific reporter. ^ such as those for taS^EE,? ' f^ 6 label 

Such an assay which turns on the furv*v, , 9 h ° rmone <«**» as 

fical in identifying druos that 0031 assoa 'ation between an aoontnric , 

activity. Hencewp«^' 6 °™ al1he proteins bear homo^JX nxXecules. 

logical wrth a ^aractenstic biobgic 

Hence, the amount of !%2££Z?<! ** activity. *" assj * presence arrf amount 

As to anftodies. Buchsoo^ ^Z^ 6 '' ^ a <«and su* as a ^ 4)6 ascer «ained 

reacts with the is to ^^dn^ 0 " 8 ' W monocto "a' ^SIJp* " ^ « •» *e. as 
Present Z a^ Tal^ * ms ^"^W^T?* *»«***^, 
of a target molecule can be determined ^S J^S*^ ™^*k£ 

^irn^onnowwiNbedesaibedbvrefer ^ '"•"""^ 

EXAMPLE 1 ^^-erenceto.e^^^ 

C0MI ^ 33,1 stes in the case of Bcl-2). 
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Plasmids expressing the 19 kD bait and a human cDNA expression library designated pACT (Durfee et al , 1993) 
tagged with the GAL4 activation domain (Durfee et al., 1993) were used for simultaneous transformation of yeast indi- 
cator strains, QGY1 ::171 (Gill and Ptashne, 1987) or Y153 (Durfee et al., 1993) by the lithium acetate method of Schi- 
est) 8l Giest (1989) Functional reconstitution of the GAL4 transactivation function through interaction between the 19 
kD moiety of trie GAL4(M47)-19K fusion protein and the cellular protein tagged with the activation domain of QAL4 
activates expression of the lacZ reporter gene in strain GGY1::171 and both the HIS3 and lacZ genes in strain Y153. 

Transfected cells were plated on synthetic dropout medium (SD), Rose et al.. 1990, lacking leucine and histidine 
(strain GGY::171) or lacking histidine, tryptophan and leucine (strain Y153) by including 12.5 to 25 mM 3-AT 3*mtnot- 
riazoie, a chemical which restores histidine auxotrophy (Kishore and Shaw, 1988). After three (QGY::171) to five (Y153) 
days at 30°C, colonies were lifted onto nitrocellulose filters (Schleicher and Schuefl, BA85) and made permeable by 
freezing the fyters for 5-10 sec on aluminum foil floats placed over liquid nitrogen. 

Filters were overlaid on Whatman filters saturated with Z buffer (Miller, 1972) containing 1mg/ml X-Gal and incu- 
bated at 30°C (Breedan and Nasrnyth, 1985). Colonies which turned blue within a period of 1-6 hr were picked and 
patched onto fresh selective plates. 

The cells then were diluted in TE (10 mM Tris, pH 8.0 and 1 mM EDTA), plated onto selective plates and well-iso- 
lated colonies which were positive in the X-Ga! assay were selected Cells from the isolated colonies were patched onto 
selective plates and were then grown in selective liquid SD media. After overnight growth, the plasmid DNAs were 
extracted from the cells using glass beads (Hoffman and Winston, 1987) and the recovered DNA was used to transform 
an E. cdi leuB strain (obtained from Stanley Fields, S.U.N.Y. Stony Brook) by electroporation using a BRL Cellporator 
according to the specifications of the manufacturer. 

Ampicillin-resistant colonies were selected and replica plated on minimal M9 plates lacking amino acids to elimi- 
nate the plasmid expressing the bait protein. Plasmid DNA (containing the LEU marker) from colonies which grew on 
M9 media was prepared by alkaline lysis. The interaction between the 19 kD protein and the proteins coded by the 
cDNA clones was further ascertained by a second round of yeast transformation and X^gal staining. The cDNA clones 
which were positive with pMA-19K or pAS-19K and not with pMA424 or pAS1 vectors were selected for further analysis. 
The interaction of the selected cDNA clones with various heterologous protein baits then was determined to further 
ascertain the specificity of interaction with the 19 kD bait 

About 10 5 transformants of each strain were screened for the activation of the reporter genes and three strongly 
positive clones were chosen for further analysis. The clones were tested in the two hybrid screen with either the 19K 
bait or with the GAL4 DNA binding domain vectors, pMA424 (Ma & Ptashne, 1987) or pAS1 (Durfee et al., 1993). All 
three clones (22, 44 and 91) reacted positively only with the 19K bait and not with the respective vectors. 

The three clones (22, 44 and 91) were found to be defective in interaction with 19K mutant baits of interest. 

EXAMPLE 2 

To determine the specificity of the two hybrid interaction, the three clones, 22, 44 and 91 . were tested against baits 
expressing several heterologous proteins (Table 1). Among the various heterologous protein baits used, the adenovirus 
E3 1 1 .6 kD protein is of particular interest as that protein localizes on the nuclear envelope region and has an internal 
hydrophobic domain like the E1 B 19 kD protein (Scaria et al., 1992). AH three clones were negative against all the het- 
erologous protein baits tested, indicating that the proteins coded by the clones specifically interact with the 19 kD pro- 
tein. 
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Table 1 



s 


Interaction of cellular apoptosis-regulating proteins with heterologous pro- 
tein baits® 




Bait 


lacZ expression (X-gal) 








clone 22 


clone 44 


clone 91 




10 


19 kD 
HIV-1 Rev 


B 
W 


B 
W 


B 
W 






HIV-1 Tat 


W 


W 


W 






HIV-1 Net 


W 


W 


W 


B=Blue 


15 


HTLV-I Rex 


W 


W 


W 


W-Whrte 




SNF1 


ND 


W* 


w* 






E1A-Exon2 


W 


W 


w 




20 


h-lamin 


w 


W 


w 




E3-11.6 kD 


w 


W 


w 




25 


The heterologous baits pLAM5 (human lamin) and pAS-SNFI (yeast 
SNF1) have been described (Bartel et al., 1993, Durfee et al., 1993). Baits 
pHIV-1 Rev, pHIV-1 Tat, pHIV-1 Nef and pHTLV-1 Rex expressing the vari- 
ous HIV or HTLV regulating proteins were constructed as described herein 
and obtained from Cieta D'Sa Ejpper and T. Subramanian, St. Louis Uni- 
versity Medical Center, St. Louis, MO._ 



<§> Interactions with all baits except the yeast SNF1 protein were carried out in 
yeast strain GGY1 ::171 . Interaction with SNF1 was carried out in yeast strain 
Y153. 

* Colonies which appeared on -his, -trp, -leu plates after 6-8 days Results of 
X-gal staining are indicated as B, blue; W, white; or ND, not done. 



35 

EXAMPLE 3 

Proteins coded by the various cDNA clones were prepared by in vitro transcription and translation using the phage 
40 T7-based expression vector, pET3b (Studier et al., 1990) and a commercially available kit (Promega, Madison. Wl). 
35 S-labeled proteins coded by clones 22, 44 and 91 were incubated with protein extracts prepared from human 293 
cells (Graham et al., 1977) which express abundant amounts of 19 kD protein or extracts of HeLa cells which do not 
express any adenovirus proteins. The interaction of the labeled proteins with the 19 kD protein then was analyzed by 
immunoprecipitation with an antibody specific for the 1 9 kD protein (Green et al. , 1 982). The 293 eel I extract was used 
45 as a source of 19 kD protein rather than purified protein since the cell extract may be siitaWe to detect interaction of 
exogenously added proteins with 1 9 kD protein if the interaction occurs indirectly via other cell Jar proteins. 

Exogenou6ly added ^-labeled proteins coded by both clones readily were precipitated with 1 9K antibodies in the 
presence of 293 cell extracts. Under the same binding conditions, 35 S4abeled luctferase (control) was not precipitated. 
Similarly, no significant amounts of the various labeled proteins were precipitated from HeLa cell extracts. 
so In all cases, immunopreciprtations with normal rabbit serum also did not precipitate any detectable levels of the pro- 
teins. 

The results indicate that the proteins coded by clones 22, 44 and 91 , (hereafter referred to as 19K-interacting pro- 
tein (Nip) 1 , Nip2 and Nip3 f respectively) specifically interact with the 19 kD protein either directly or indirectly and sub- 
stantiate the results obtained by the two hybrid screens. 

55 

EXAMPLE 4 

To confirm the interaction of the Nip proteins with the 19 kD protein in vivo, BSC40 were transfected with plasmid 
vectors expressing the 19 kD or various Nip proteins under the transcriptional control of the T7 promoter in a mamma- 
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lian expression vector, pTM1 (Moss et al.. 1990) The cells were infected with the recombinant vaccinia virus vTF7-3 
expressing the T7 RNA polymerase gene (Fuerst et al., 1986) to induce the expression of proteins from the various 
pTMt plasmids. To facilitate the analysis of protein interactions, the Nip proteins were tagged with an epitope corre- 
sponding to a 9 amino acid region of the haemagglutinin (HA) protein of influenza virus (Field et al , 1988) The pTM1 

5 vector expressing only the HA epitope was used as a control. 

Cells were labeled in vivo with a 35 S-methionine and 35 S-cysteine mixture, lysed and the lysates divided and sub- 
jected to irrmjrwpreqprtation analysis using either a monoclonal HA antibody 12CA5 (Boehringer-Mannheim, Indian- 
apolis, IN) or a 19 kD antipeptide antibody (Green et al., 1 982). The precipitated proteins were analyzed by SDS-PAGE 
and autoradiography to determine whether comparable levels of the 19 kD protein were present in each sample. The 

10 HA immunoprecipftates were separated by SDS-PAGE, blotted onto nitrocellulose and probed with the 19 kD antibody 
to detect the presence of coprectpHating 1 9 kD protein. A horseradish pernxkJaseKftemilijmine6cem detection system 
(ECL) (Amersham, Arlington Heights, IL) was used to visualize bound antibodies. 

The 19 kD protein was not detected in HA immunoprecipitates from cells transfected with the pTM1-HA vector and 
pTM1-19kD, but clearly was seen coprecipitating with HA-tagged Nip1, Nip2 and Nip3 

15 Cells also were cotransfected with plasmids expressing each of the Nip proteins and either the 1 9 kD or the vector 
plasmid. Cell lysates were immunoprecipftated with the 19 kD antibody, blotted and probed with anti-HA antibody. That 
experimentation yielded comparable results for Nip1 and Nip3 (Nip2 is not transfered to nitrocellulose under standard 
blotting conditions) and provides additional support for the specificity of the interactions. 

The results indicate that the 19 kD protein interacts specifically with each of the Nip proteins in an in vivo immuno- 

20 precipitation assay. 

EXAMPLE 5 

To determine the subcellular localization of the 19 kD-interacting proteins and to investigate whether the proteins 

26 colocalize with the 19 kD protein within the cell, COS7 cells (Guzman, 1981) with plasmids expressing the protein cod- 
ing sequences of the cDNA clones either alone or with a 19 kD -expressing plasmid were cultured and the transfected 
cells were analyzed by indirect immunofluorescence. To facilitate the immunofluorescence analysis of the 19 kD-inter- 
acting proteins, the proteins first were tagged with an epitope corresponding to a 9 amino acid region of the hemagglu- 
tinin (HA) protein of influenza virus (Field et al., 1988). 

30 The cDNA's were cloned into the expression plasmid pCMV-HA which tags the proteins with an HA epitope. Cos 7 
cells, grown on 22 mm 2 covers! ps in 30 mm dishes, were transfected with plasmids expressing the 1 9 kD protein (pRc- 
CMV-19K) and/or each of the pCMV-HA based cDNA clones using the UpofectAMINE™ reagent (Gbco-BFSL) following 
the manufacturer's protocols. Forty eight hours post-transfection cells were fixed with 3.7% formaldehyde in PBS and 
permeabilized with ice-cold methanol. Cells were double stained with mouse monoclonal anti-HA (Berkeley Antibody 

35 Co., Inc.) and rabbit polyclonal anti-19 kD peptide serum (Green et al., 1982), and visualized with goat anti-rabbit mod- 
ami ne conjugate (Cappel) and goat anti-mouse fluorescein conjugate (Pierce). Cells were observed and photographed 
for rhodamine (19 kD) and fluorescein (HA-tagged clones) fluorescence. The localization of each cellular protein was 
examined in the absence of or presence of the 19 kD protein. The 19 kD protein primarily was expressed at the nuclear 
envetope/endoplasmic reticulum region. Nip1 (clone 22) and Nip2 (clone 44) also appear to be concentrated primarily 

40 on the nuclear envelope region in addition to other cytoplasmic structures. Nip3 (clone 91) has a different pattern of 
localization having a punctate pattern resembling the patterns exhibited by mitochondria (Alberts et al., 1989). 

In cells coexpressing Nip1, Nip2 or Nip3 and the 19 kD protein, both the 19 kD and each of the cellular proteins 
cdocafized primarily at the nuclear envelope region. Hence, the apoptosis-reguiating cellular proteins localize identi- 
cally with the 19 kD protein, which includes a marked shift in the localization pattern of Nip3. 

46 

EXAMPLE 6 

The two hybrid strategy provides a rapid method for genetic analysis of in vivo protein-protein interaction. A series 
of 1 9K mutants (Table 2) with regard to ability to protect against cell death induced by the DMA damaging agent ctsplatin 

so and to cooperate with E1 A in transformation of primary cells were used (Subramanian et al., 1993). The wild type 19K 
sequence is identified as SEQ ID NO:8; the 14-5 mutant as SEQ ID NO:9; the 29-0 mutant as SEQ ID NO:10; the 50- 
1 mutant as SEQ ID NO:11; the 75-6 mutant as SEQ ID NO:12; the 90-6 mutant as SEQ ID NO:13; the 123-4 mutant 
as SEQ ID NO: 14; and the 146-E mutant as SEQ ID NO: 15. 

In Table 2 the names of the various mutants are marked above the amino acid sequence and the amino acid 

55 changes are indicated below the sequence. The actual amino acids involved in the various mutations are indicated in 
boldface. Z indicates a stop codon. a indicates deletion. The underlined sequence is homologous to conserved domain 
I of bd-2 (Williams & Smith. 1993). 
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Table 2. 19K and Bcl-2 Mutants 



10 



15 



E1B 19K 

14-5 29-0 50-1 

1 MEAWECLEDFSAVRinJ^EQSSNSTSWFWRPLWGSSQAKLVCRIKEDYKWE 
AS AS AS 

75-6 90-6 
61 LFDSLNLGH OALFQEKVIKTI^^ 

AS A 

123-4 146-E 
121 HLWRAVVRHKNRLLLLSSVMAIIPTEEQQQEEARIOaiRQEQSPWNPRAGLDPRE 

AS SZ 



20 

Bcl-2 (Seto et al., 1988) 

42-8 

1 MAHAGRSGYDNREIVMKYIHYKLSQRGYEWDAGDVGAAPPGAAPAPGPFSSQPGHTPHPA 

A 

25 80-6 106-5 

6 1 ASRDPVARTSPLQTPAAPGJUUIGPMSPVPPVVHLT 

A A 

121 LTPFTARGCFATVVEELFRIX5VNWGRIVAFFEFGGVMCVESVNREMSPLVDN 

30 

181 LNRH^TWIQDNGGVroAFVELYGPSMRPLFDFSWLSIJCTLLSIJU.VGACITI^AYLG^ 
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The 1 9K-coding regions of the mutants were cloned in the pMA424 vector and tested for ability to interact with Nip1 , 
40 Nip2 and Nip3 in the two hybrid assay. The interaction was assessed by determining the level of lacZ expression by the 
X-gal filter blue/white color assay as well as by a quantitative ONPG-based p-galactosidase assay (Rose et al., 1990) 
(see Table 3). 
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Table 3 



Interaction of cellular apoptosis-regulating proteins with 19K mutants 


Mutant 


Cell death 
suppres- 
sion® 


Interaction (Rela- 
tive 


with cellular level 
oflacZ 


proteins expres- 
sion) 






Nipt 


Nip2 


Nip3 






X-gal 


ONPG 


X-gal 


ONPG 


X-gal 


ONPG 


Wt 


+ 


B 


1 


B 


1 


B 


1 


14-5 


(+) 


B 


0.61 


B 


0.55 


LB 


0.16 


29-0 


+ 


B 


0.33 


B 


0.32 


B 


0.19 


50-1 




W 


<0.01 


W 


0.09 


W 


0.01 


75-6 


+ 


B 


1.72 


B 


1.60 


B 


0.22 


90-6 




W 


<0.01 


W 


0.04 


W 


0.01 


123-4 




W 


0.05 


LB 


0.24 


LB 


0.10 


146-E 


+ 


B 


0.87 


B 


0.98 


B 


0.28 


The protein interaction studies were carried out in yeast strain QGY1 ::171 . ResuHs 
of X^gal staining are indicated as B, blue; W, white; or LB, light blue. 



@ Results on suppression of cell death are based on Subramanian et al (1 993). 
(+) Indicates positive at reduced levels. 

p-Oalactosidase activities were determined from 3 to 4 independent colonies selected at 
random and grown in liquid selective SO medium (Rose et al., 1990). The p-galactosi- 
dase activity is expressed in units described by Rose et al., 1990. 



All three 19 kD-interacting proteins were negative for interaction with mutants 50-1 (amino acids 50-51) (SEQ ID 
NO:1 1) and 90-6 (amino acids 90-96) (SEQ ID NO:13). Nipl and Nip2 interacted at a reduced level with mutant 123-4 
(substitutions for residues 123-124) (SEQ ID NO:14), but interacted more efficiently with the 19K mutant 75-6 (which 
carries a substitution for residues 75-76) (SEQ ID NO:12) than with wild-type (wt) 19K (SEQ ID NO:8). 

Nip3 also exhibited a similar pattern of interaction; however, the overall level of expression of lacZ was lower. The 
studies identify two critical regions of the 19 kD protein marked by mutants 50-1 (SEQ ID NO:1 1) and 90-6 (SEQ ID 
NO:13) required for interaction with the cellular apoptosis-reducing proteins. In addition, sequences around residues 
123-124 (mutant 123-4) (SEQ ID NO:14) also may influence the interactions. 

"Rie pattern of interaction of the three cellular proteins with the 19 kD protein is diagrammatically illustrated in Fig- 
ure 1 All the 19K mutants used make stable proteins in mammalian cells (Subramanian et al. 1993) and in yeast as 
GAL-4(1-147)-19 kD fusion proteins. Thus, the 19K mutants 50-1 (SEQ ID NO:1 1) and 90-6 (SEQ ID NO:13) defective 
for suppression of cell death, also are defective for interaction with the cellular proteins. The interaction of the cellular 
proteins with mutant 123-4 (SEQ ID NO:14) which also is defective in suppression of cell death is, in general, much 
reduced. 

EXAMPLE 7 

Since Bd-2 functionally can substitute for 1 9K during adenovirus replication, the 1 9 kD-interacting cellular proteins 
also were shown to interact with the Bd-2 protein. 

The two hybrid analysis using a pMA424-based bait expressing the human Bcl-2 protein (Hockenbery et al.. 1 990) 
was used. Plasmids pMA-Bcl2 and pAS-Bd2 express residues 1-239 of Bcl2. Yeast cells (GGY1::171) were trans- 
formed either with the pMA424-19K bait or with the pMA424-Bcl-2 bait along with each of the activation domain-tagged 
cDNA clones. The expression of lacZwas determined by the X-gal filter assay as well as by the liquid ONPG p-galac- 
tosidase assay (Table 4). 
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Table 4 



Interaction of 19 kD-irrteracting proteins with Bcl-2 


Protein 


Relative level of lacZ expression® 




19 kD 


Bcl-2 




X-gal 


ONPG 


X-gal 


ONPG 


Nip1 


B 


1 


B 


1.8 


Nip2 


B 


1 


B 


0.7 


Nip3 


B 


1 


B 


1.5 



@ Relative interaction is based on the p-galactosh 
dase activity expressed in cells expressing the 
respective tagged cDNA clones and the 19K bait. B, 
blue. 



All three clones interacted with the Bd-2 bait. Nipt and Nip3 in general interacted more efficiently with the Bcl-2 
bait as compared with the 19 kD bait. The interaction of Nip2 was slightly lower with Bcl-2 than with 19 kD. 

Proteins coded by the three cDNA clones were prepared by in vitro transcription and translation according to pro- 
tocols supplied by commercial vendors and incubated with cell extracts prepared from CHO cells or CHO ceils infected 
with an adenovirus recombinant that overexpr esses the human Bcl-2 protein (Ad-Bcl2) (obtained and available from T. 
Subramanian and B. Tarodi, St. Louis University Medical Center, St. Louis, Missouri). The proteins were immunopredp- 
rtated with a hamster monoclonal antibody specific for human Bcl-2 (Hockenbery et al., 1990) and analyzed by SDS- 
PAGE. 

All three 19 kD-irrteracting proteins could be precipitated from extracts prepared from cells infected with the Ad-Bcl2 
virus while in contrast there was no significant amount of proteins precipitated from uninfected cells. 

To further substantiate the interaction of the Nip proteins with Bd-2, BSC40 cells infected with vTF7-3 (Fuerst et 
al., 1986) expressing the T7 RNA polymerase gene and cotransfected with plasmids expressing Bc1-2 and one of the 
HA-tagged Nip proteins were labeled, lysed and analyzed essentially as described in Example 4 for the 19 kD protein. 
Comparable levels of Bcl-2 protein were present in each sample as determined by immunopredprtation using the Bd- 
2 monodonal antibody 6C8 (Hockenbery et al., 1990). Each sample was subjected to immunopredprtation using the 
HA monodonal antibody 12CA5 (Boehringer-Marmheim); the precipitated proteins were separated by SDS-PAGE, Wot- 
ted and probed for the presence of Bd-2. Bcl-2 clearly was copredpitated with HA-Nip1 , HA-Nip2 and HA-Nip3. 

An alternate experiment in which the cells were cotransfected with each of the Nip proteins and either Bd-2 or the 
vector plasmkJ and cell lysates were immunopredpttated with antibody directed against the Bd-2 protein and probed 
with anti-HA antibody, yielded comparable results for Nipt and Nip3 (Nip2 is not transfer ed to nitrocellulose under 
standard blotting conditions) and provides additional support for the specificity of the interactions. Thus, using in vivo 
coimnunopredpitation, a specific interaction between each o* the Nip proteins and Bd-2 was found. The results, 
together with the in vitro irmiunopredprtation data, indicate that the three Nip proteins interact with Bd-2 as well as with 
the 19 kD proteins. 

Using bcl-2 as the bait in the two hybrid analysis, three additional regulating proteins (Bip for bd-2 interacting ero- 
tein) were identified. Two of the dones (Bip 1 A, SEQ ID NOS26-27 and Bip 13, SEQ ID NOS:30-31) also interact wHh 
the 19K bait, while clone Bip 5, SEQ ID NOS:23-29, did not Bip 1 A was tested for interaction with the 19K mutants as 
described in Example 5 and exhibited a similar pattern of interaction to that seen with NIP's 1 , 2 and 3. Interaction was 
negative with mutant 90-6 (SEQ ID N0:13) and greatly diminished with mutant 50-1 (SEQ ID NO:1 1), as illustrated in 
Figure 1. 

EXAMPLE 8 

The observation that the various 19 kD-interacting proteins also interact with the Bcl-2 protein is significant since 
the primary amino add sequence of the 19 kD protein is not significantly homologous to Bcl-2 and related proteins 
(reviewed by Williams and Smith. 1 993). Since four cellular proteins tested (Nip1 . Nip2. Nip3 and Bip1 A) failed to inter- 
art with 19K mutants 50-1 (SEQ ID NO:11) and 90-6 (SEQ ID NO:13), regions of homology corresponding to the 
mutated regions of 19 kD protein must exist. Three regions of homology between the amino acid sequences encom- 
passing the 19K mutants (50-1 and 90-6) and Bd-2 were identified. 
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Based thereon, three different deletion mutants within the Bcl-2 (SEQ ID NO: 16) coding region where constructed 
by oligonucleotide-directed mutagenesis using commericaily available kits (Amersham) (Table 2). The mutants lack Bel- 
2 (SEQ ID NO:16) residues 42 to 48 (#42-6) (SEQ ID NO:17), 80 to 86 (#80-6) (SEQ ID NO:18) and 106 to 115 (#106- 
5) (SEQ ID NO:19) of Bcl-2. The mutants were cloned in pMA424 and used in the two hybrid assay. Interaction was 
determined by the tacZ expression assays (Table 5). 



Table 5 



Interaction of 19 kD-irrteracting proteins with Bcl-2 mutants 


Mutant 


Relative level of lacZ expression® 




Nip1 


Nip2 


Np3 




X^gal 


ONPG 


X^gal 


ONPG 


X^gal 


ONPG 


wt 


B 


1.0 


B 


1.0 


B 


1.0 


42-8 


W 


0.16 


W 


0.09 


W 


0.19 


80-6 


B 


1.21 


B 


0.80 


B 


2.87 


106-5 


W 


0.01 


W 


0.06 


W 


0.11 



@ Relative interaction is based on the p-galactosidase activity 
expressed in cells expressing the respective tagged cON A clones 
and the Bcl-2 baits expressing the various mutants. B, blue; W, 
white 



Mutants 42-8 and 106-5 were defective for interaction with the three clones tested while mutant 80-6 was not Since 
the amino acid sequences of Bcl-2 deleted in mutants 42-8 and 106-5 appear to be homologous to sequences around 
19K mutants 90-6 and 50-1, respectively, the 19 kD-interacting proteins recognize bipartite sequence motifs common 
to both the 19 kD and Bcl-2 proteins (Fig. 1). 

EXAMPLE 9 

The DNA sequences of the six cDNA clones were determined by dideoxy sequencing using the Sequenase kit 
(U.S. Biochemicals) of p Act-based plasmids (Durfee et al., 1993) or after subcloning into the pBluescript II KS(+) vector 
(Stratagene, San Diego, CA) using Sequenase version 2 (United States Biochemical Corp.). Sequencing first was initi- 
ated with primers complementary to sequences upstream of the cDNA cloning site in pAct or the T3 promoter of pBlue- 
script Subsequently, other primers were synthesized on the basis of newly determined sequences and used for 
sequencing both strands. The reading frame in relation to the QAL4 activation domain was established. 

Since the activation domain is tagged at the 5* end of the cDNA's, a fraction of cDNA's isolated by the two hybrid 
system may be lacking the sequences corresponding to native & ends of the various mRNA's. Hence, the 5'-upstream 
sequences corresponding to the three isolated cDNA clones were obtained by reverse transcription of poly(A)-oorrtain- 
ing RNA prepared from human HeLa cells or from human placenta and PGR amplification (Frohman et al., 1988) using 
two nested primers (SRACE) using a commercially available kit (5'AmpliFINDER, Clonetech) or from cDNA clones iso- 
lated from a human B cell cDNA library (Ausubel et al., 1992). 

Downstream sequences were obtained by 3'RACE (described hereinbelow) using a commercially available kit 
(GibcoBRL). The Bluescript-based cDNA clones were prepared after infection with the helper phage and sequenced 
using appropriate primers. The DNA fragments obtained by RACE were cloned and sequenced. 

The RACE analysis provided additional DNA sequences corresponding to three codons including an ATX3 codon 
for clone 22 (Nip1 ) while extensive analysis did not provide adcftional protein coding sequences corresponding to cDNA 
clones 44 (Np2) and 91 (Nip3). The coding region of clone 44 at the 3' end (30 amino acids) was obtained by 3* RACE 
analysis. 

For Bip5, 139 bp of 5* sequences were obtained as follows. The two hybrid clone contained nucleotides 109 to 
1542. Comparison of the Bip5 sequence with a human fetal brain cDNA clone revealed substantial sequence identity 
and overlap (clone T08302, EST 06193, Adams et al , 1993) between the two clones. Clone T08302 contains a 366 bp 
fragment and includes an open reading frame beginning at the ATQ of the clone. T08302 was ligated to the 5' end of 
the Bip5 clone following PCR amplification. 
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EXAMPLE 10 



The amino acid sequences of Nip1 (SEQ ID NO:21). Nip2 (SEQ ID N023). Nip3 (SEQ ID N025). Bip1 A (SEQ ID 
NO:27), Bip5 (SEQ ID NO:29) and Bb13 (SEQ ID NO:31) based on DNA sequences are presented in Table 6. Putative 
transmembrane domains of Nip1 and Nip3 are underlined. 

The amino acid sequences were analyzed to determine if any similarity to other sequences existed in various 
known and available data banks (QenBank. PIR and SwissProt National Center for Biotechnology Information (NCBI) 
using the FASTA. TFASTA programs (Pearson & Lipman 1988) available in the University of Wisconsin Genetic Com- 
puter group (UWGCG package) and BLAST (NCBI) (Alschul et al.. 1990) algorithms. The sequences also were ana- 
lyzed by Ihe PROSITE (UWGCG Software package) program to identify functional sequence motifs. 

Nto1 (SEQ ID N021) is a 228 amino acid protein and contains a putative membrane-spanning hydrophobic 
domain The presence of the membrane-spanning domain enables the protein to form stable associations with cellular 
membranes and accounts for the observed localization of the protein to the nuclear envelope/endoplasmic reticulum 

region of the cell. , . , M 

Sequence comparison of Nip1 (SEQ ID NO:21) indicated that the protein is not significantly homologous to other 
known proteins However, a 59 to 83 amino acid region of Nip1 has homology (29 to 36 per cent identity and 55 to 60 
per cent similarity) to a conserved region (Figure 2A) within Ihe catalytic domain of three mammalian (rat mouse and 
cow) calmodulin-dependent 3-5-cydic nucleotide phosphodiesterases (PDE) (SEQ ID N034) (BenUey et al.. 1992; 
Pollietal., 1992; Repaske et al., 1992). 

Because Nb1 is likely to possess an enzymatic activity. Nip1 can be used as a diagnostic tool to ascertain the state 
of the cell. Hence, excess of reduced Nip1 phosphodiesterase activity can be a means for monitoring the metabolic 

state of a cell, tissue or organ. 

Nip2 (SEQ ID NO:23) is a 314 amino acid protein. Sequence comparison revealed that a 126 amino acid region of 
Nip2 shares significant homology (47 per cent identity and 66 per cent similarity; Figure 2C) to the human GTPase acti- 
vating protein RhoGAP (SEQ ID NO:35) (Barfod et al.. 1993; Lancaster et al., 1994). The homology between Nip2 and 
RhoGAP does not extend to the functional domain of RhoGAP and is located in an upstream region which has been 
postulated to be removed by proteolysis (Barfod etal.. 1993). 

A scrutiny of the Nip2 sequence also revealed the presence of a putative Ca^-toinding motif (amino acids 79-90 of 

SEQ ID NO: 23) (Figure 2B). 

Even though Nip2 does not contain a membrane-spanning hydrophobic domain, indirect immunofluorescence 
analysis indicates that it is associated with cytoplasmic structxres. , , . 

Nip3 (SEQ ID N025) is a 194 amino acid protein and contains a presumptive membrane-spanning hydrophobic 
domain Nip3 also appears to be a novel protein. However, the cDNA sequence is highly homologous to a rat cDNA 
clone which encodes the rat brain calbinrjn-D protein (Hunziker and Schrickel. 1988). A smaller cDNA clone isolated 
from human fetal brain (Adams et al.. 1992) is identical to sequences within the Nip3 coding region. cDNA clone 91 
(Nip3) and the fetal brain cDNA do not contain the coding sequences for cabindin. Similarly, two other cDNA clones 
containing NipS sequences isolated from a human B cell cDNA library do not contain the cabindin coding sequences. 
An extensive search for additional sequences in DNA amplified by (TRACE of human placental RNA with primers spe- 
cific for clone 91 did not yield additional 5' sequences corresponding to cabindin coding sequences. 

Thus. Nip3 is coded by a unique mRNA in human cells. That notion is in agreement with the observation that rat 
cabindin cDNA hybridizes to muHiple mRNA species from human membrane (lacopino and Christakos. 1990). 

All three cellular proteins contain sequence motifs designated PEST (enriched in proline, glutamic aad. serine and 
threonine) (SEQ ID N033) sequences (Rogers et al. 1986) suggesting that the proteins may be degraded rapidly and 
expressed in a stage-specific manner (Rogers et al., 1986). 

BiplA (SEQ ID N027) is a 160 amino acid protein. 

Bjp5 (seq id N029) is a 259 amino acid protein and does not exhibit significant homology to other known pro- 

tG,n On sequencing, it was determined that clone Bb13 (SEQ ID NO:31) bears a high degree of homology with CD22 
(Wilson et al.. 1991 ; Stamenkovic and Seed. 1990). CD22 is known to increase intracellular calcium and may be a B 
eel activator, however, the role of CD22 m apoptosis heretofore was unknown. 
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Amino Acid and Nucleic Acid Sequences 



A. Nipl 



AGTCCCCAACATGGCGGCTCCCCAAGACGTCCACGTCCGGATCTGTAACCAAGAGATTGT 
MAAPQDVHVRICNQEIV 
CAAATTTGACCTGGAGGTGAAGGCGCTTATTCAGGATATCCGTGATTGTTCAGGACCCTT 

KFDLEVKALIQDIRDCSGPL 
AAGTGCTCTTACTGAACrGAATACTAAAGTAAAAGAGAAATTTCAACAGTTGCGTCACAG 

SALTELNTKVKEKFQQLRHR 
AATACAGGACCTGGAGCAGTTGGCTAAAGAGCAAGACAAAGAATCAGAGAAACAACTTCT 

IQDLEQLAKEQDKESEKQLL 
ACTCCAGGAAGTGGAGAATCACAAAAAGCAGATGCTCAGCAATCAGGCCTCATGGAGGAA 

LQEVENHKKQMLSNQASWRK 
AGCTAATCTCACCTGCAAAATTGCAATCGACAATCTAGAGAAAGCAGAACTTCTTCAGGG 

ANLTCKIAIDNLEKAELLQG 
AGGAGATCTCTTAAGGCAAAGGAAAACCACCAAAGAGAGCCTGGCCCAGACATCCAGTAC 

GDLLRQRKTTKESLAQTSST 
CATCACTGAGAGCCTCATGGGGATCAGCAGGATGATGGCCCAGCAGGTCCAGCAGAGCGA 

ITESLMGISRMMAQQVQQSE 
GGAGGCCATGCAGTCTCTAGTCACTTCTTCACGAACGATCCTGGATGCAAATGAAGAATT 

EAMQSLVTSSRTILDANEEF 
TAAGTCCATGTCGGGCACCATCCAGCTGGGCCGGAAGCTTATCACAAAATACAATCGCCG 

KSMSGTIQLGRKLITKYNRR 
GGAGCTGACGGACAAGCTTCTCATCTTCCrrTGCGCTACGCCTGTTTCTTGCTACGGTCCT 
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E L T D K T. L I F L A I. R L F L h 1 ¥ L 

CTATATTGTGAAAAAGCGGCTCTTTCCATTTTTGTGAGATCCCAAAGGTGCCAGTTCTGG 

Y I V KKRLFPFL 
CCCTTTCAGCTCOTGTTTCAGGATCTGTCCTGGTTCCTGAGCTCTAGGCTGCTAAGCTGA 

GCCACACACCCCTCCGTTTTGCACCAGTTGCCTGCAGGTTGGATGGAACACAGTGCCCCA 

CTTTTCTGCAAGTAGCTGGCTTGTAAAGGGTGAACAGAGCCATGGGAGGAAGGTCTCGCA 

TTGGGATGCCGCCCTGGGGACATACGAACCGCCTCCTTCCACCATTGTGCACTATGGGAG 

GCCGCTGCTGCGTGGAGCACTTAAAGTCCAGCCTCCAGGACCGGATGCCCCTCCTGTCTC 

CCGCTCCCATCGTGCCCTTAAATGCCAGATCTGGTGGAGGGAAGAGAGAAGAGGTAGGAA 

GAAAGGTGATGAAAACTC CTG 



B. Nip2 



CTGCGGCCGGGGGATTGGGCCGGGGTCTCCACCGCCGACCGAG 
GGGAGCGGCGTCCGCTCGGCCCTGCTTTTTGCGACCTGCCGTCAGCCCCACGTCGCCGGC 
CTGGAGGGGCGAAGAGGACGAGGGGCGACGAAGGCCCAAGGCTTCCTCCGGGGACATTGG 
CTCCCTGGATTATCAAGCAGTTTGTAGTTGACATTGAATCCAGGCTGAGGATGGAAGGTG 

MEG 

TGGAACTTAAAGAAGAATGGCAAGATGAAGATTTTCCGATACCTTTACCAGAAGATGATA 

VELKEEWQDEDFP I PLPEDD 

GTATTGAAGCAGATATACTAGCTATAACTGGACCAGAGGACCAGCCTGGCTCACTAGAAG 

SI EADILAITGPEDQPGSLE 

TTAATGGAAATAAAGTGAGAAAGAAACTAATGGCTCCAGACATTAGCCTGACACTGGATC 

VNGNKVRKKLMAPDI SLTLD 

CTAGTGATGGCTCTGTATTGTCAGATGATTTGGATGAAAGTGGGGAGATTGACTTAGATG 

PSDGSVLSDDLDESGE IDLD 

GCTTAGACACACCGTCAGAGAATAGTAATGAGTTTGAGTGGGAAGATGATCTTCCAAAAC 

GLDTPSENSNEFEWEDDLPK 

CCAAGACTACTGAAGTAATTAGGAAAGGCTCAATTACTGAATACACAGCAGCAGAGGAAA 

PKTTEVIRKGSITEYTAAEE 

AAGAAGATGGACGACGCTGGCGTATGTTCAGGATTGGAGAACAGGACCACAGGGTTGATA 

KEDGRRWRMFRIGEQDHRVD 

TGAAGGCAATTGAACCCTATAAAAAAGTTATCAGCCATGGGGGATATTATGGGGATGGAT 

MKAI EPYKKVI SHGGYYGDG 

TAAATGCCATTGTTGTATTTGCTGTCTGTTTCATGCCTGAAAGTAGTCAGCCTAACTATA 

LNAIVVFAVCFMPESSQPNY 

GATACCTGATGGACAATCTTTTTAAATATGTTATTGGCACTTTGGAGCTATTAGTAGCAG 

RYLMDNLFKYVIGTLELLVA 

AAAACTACATGATAGTTTATTTAAATGGTGCAACAACTCGAAGAAAAATGCCCAGTCTGG 

ENYMIVYLNGATTRRKMPSL 

GATGGCTCAGGAAATGTTATCAGCAAATTGATAGAAGGTTACGGAAAAATCTAAAATCCC 

GWLRKCYQQIDRRLRKNL K S 

TAATCATTGTACATCCTTCTTGGTTTATCAGAACACTTCTGGCTGTTACAAGACCATTTA 

LI IVHPSWFIRTLLAVTRPF 

TTAGCTCGAAATTCAGCCAAAAAATTAGATATCTGTTTAATTTGGCAGAACTAGCAGAAC 

ISSKFSQKIRYVFNLAELAE 

TTGTCCCCATGGAATACGTTGGCATACCAGAATGCATAAAACAAGTTGATCAAGAACTTA 

LVPMEYVGIPECIKQVDQEL 

ATGGAAAACAAGATGAACCGAAAAATGAACAGTAAGTTTGGCATCTAGTCCAAACAAGAC 

NGKQDEPKNEQ 

TGAAGAATGTGCTGATGGAGCAGTGCTGTTTCTGCATTCATAATGCATTTATTGGCCAT 
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ATTTTTATGTAACCTGTTACAAAATAGACTTGACTX™ 

AGGGACTGTTCACTGCTGTACTGGTTTGCAAATTTCTTGAATTTAGCTCTTTATGCTAAL 10, 1 
ATTATTATCATTTTATAT 

C. Nip3 

ggagcgcccgggatgcaggac^agagcctgcaggg^ 

AATA^T^AACGG^ 

KrwrNrGSVPASVSIYNGDMb 
A^y^ATACTGCTGGACGCACAGCATGAGTCTGGACGGAGTAGCTCCAAGAGCTCTCACTGT 

ir j j r DAOHESGRSSSKSSHC 

gacIgcccacctcgctcgcagacaccacaagataccaacagggcttctgaaaca^ 

ncppRSOTPQDTNRASE TU1 

?atagcattggagagaaaaacagctcacagtctgaggaagatgatattgaaagaagga^ 

ticTrFKNSSQSEEDDlERK^ 

gaagttgaaagcatcttgaagaaaaactcagattggatatgggattggtcaagtcggccg 

wvpqTLKKNSDWlWDWSfaKf 

gaaaatmtccccccaaggagttcctctttaaacacc 

pmt ppKEFLFKHPKRTATL t> 

atgaggaacLgagcgtcatgaag^ 

ttcctt£c*tc^ 

LgcgtctgacaLct^^ 

?tagtggat?aotc^^ 

ttcgtggctcgagagactagaatcgcagatacgaaaaccccgcagc 



D. BiplA 

cagcatcgccgccgccagaggagaaatgtctgaagta 

M S E V 

agacccctctccagagacatcttgatggagaccctcctgtatgagcagctcct^ 

cwacca^Lgttcttwcatgactgactctgaagaggacct^^ 

t?cgIttctttggaatgcatgga«^cagtgacgcattggccctgcggctggc CT 

t? n q t FCMEGSDALALRLA^x 

g L;acgaga^cg^^ 
aSgcacagcc^tc^g?^^ 



m H S L G L A F I Y DQTEDIRDVL 

agaagtt?cat<*ac^^ 
ccgaaccccgtotcctgggtgtcctgcgaaca^ 

G cgc?gc?gc?gc.cgctcctcagcgggggcctgcacctgctgctcaagtgaggccccggc 
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ALLLPLLSGGLHLLLK 
GGCTCAGGGCGTGGCTGGCCCCACCCCCATGACCACTGCCCTGGAGGTGGCGGCCTGCTGCT 
GTTATCTTTTTAACTGTTTTCTCATGATGCCTTTTTATATTTAAACCCCGAGATAGTG 
CTGGAACACTGCTGAGGTTTTATACTCAGGTTTTTTGTOT 
TTTTTTCTAAAAGATGAATTCCTATGGCTCTGCAATTGTCACCGGTTAACT 
GCCCAGGAAGAGCCATTCACTCCTGCCCCTGCCCACACGGCAGGTAGCAGGGGGA 
GGTCACACCCCTGTGTGATATGTGATGCCCTCGGCAAAGAATCTACTGGAATAGATTCCG 
AGGAGCAGGAGTGCTCAATAAAATGTTGGTTTCCAGCAAAAAAAAAAAAAAGGCCTTCGT 
GGCCTCGAG 



E. Bip5 





ATG 


GAC 


GCC 


TGG 


GTC 


CGC 


TTC 


AGT 


GCT 


CAG 


AGC 


CAA 


GCC 


CGG 




M 


D 


A 


W 


V 


R 


F 


S 


A 


Q 


S 


Q 


A 


R 


GAG 


CGG 


CTG 


TGT 


AGG 


GCC 


GCC 


CAG 


TAT 


GCT 


TGC 


TCT 


CTT 


CTT 


GGC 


E 


R 


L 


C 


R 


A 


A 


Q 


Y 


A 


C 


S 


L 


L 


G 


CAT 


GCG 


CTG 


CAG 


AGG 


CAT 


GGA 


GCC 


AGT 


CCT 


GAG 


TTA 


CAG 


AAA 


CAG 


H 


A 


L 


Q 


R 


H 


G 


A 


S 


P 


E 


L 


Q 


K 


Q 


ATT 


CGA 


CAA 


CTG 


GAG 


AGC 


CAC 


CTG 


AGC 


CTT 


GGA 


AGA 


AAG 


CTT 


CTA 


r 


R 


Q 


L 


E 


S 


H 


L 


S 


L 


G 


R 


K 


L 


L 


CGC 


CTG 


GGT 


AAC 


TCA 


GCA 


GAT 


GCC 


CTT 


GAG 


TCA 


GCC 


AAA 


AGA 


GCT 


R 


L 


G 


N 


S 
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CTA 


ATT 


TCT 


AGA 







17 



EP 0 733 706 A2 



F. Bipl3 

AGTCAAGTCGAGTTTCTTTGCATGTCACTGGCCAATCCTCTT 
SQVEFLCMSLANPL 
CCAACAAATTACACGTGGTACCACAATGGGAAAGAAATGCAGGGAAGGACAGAGGAGAAA 

PTNYTWYHNGKEMQGRTEEK 
GTCCACATCCCAAAGATCCTCCCCTGGCACGCTGGGACTTATTCCTGTGTGGCAGAAAAC 

VHI PKI LPWHAGTYSCVAEN 
ATTCTTGGTACTGGACAGAGGGGCCCGGGAGCTGAGCTGGATGTCCAGTATCCTCCCAAG 

ILGTGQRGPGAELDVQYPPK 
AAGGTGACCACAGTGATTCAAAACCCCATGCCGATTCGAGAAGGAGACACAGTGACCCTT 

KVTTVIQNPMPIREGDTVTL 
TCCTGTAACTACAATTCCAGTAACCCCAGTGTTACCCGGTATGAATGGAAACCCCATGGC 

SCNYNS SNPSVTRYEWKPHG 
GCCTGGGAGGAGCCATCGCTTGGGGTGCTGAAGATCCAAAACGTTGGCTGGGACAACACA 

AWEEPSLGVLKIQNVGWDNT 
ACCATCGCCTGCGCACGTTGTAATAGTTGGTGCTCGTGGGCCTCCCCTGTCGCCCTGAAT 

TIACARCNSWCSWASPVALN 
GTCCAGTATGCCCCCCGAGACGTGAGGGTCCGGAAAATCAAGCCCCTTTCCGAGATTCAC 

VQYAPRDVRVRKIKPLSEIH 
TCTGGAAACTCGGTCAGCCTCCAATGTGACTTCTCAAGCAGCCACCCCAAAGAAGTCCAG 

SGNSVSLQCDFSSSHPKEVQ 
TTCTTCTGGGAGAAAAATGGCAGGCTTCTGGGGAAAGAAAGCCAGCTGAATTTTGACTCC 

FFWEKNGRLLGKESQLNFDS 
ATCTCCCCAGAAGATGCTGGGAGTTACAGCTGCTGGGTGAACAACTCCATAGGACAGACA 

ISPEDAGSYSCWVNNSIGQT 
GCGTCCAAGGCCTGGACACTTGAAGTGCTGTATGCACCCAGGAGGCTGCGTGTGTCCATG 

ASKAWTLEVLYAPRRLRVSM 
AGCCCGGGGGACCAAGTGATGGAGGGGAAGAGTGCAACCCTGACCTGTGAGAGTGACGCC 

SPGDQVMEGKSATLTCESDA 
AACCCTCCCGTCTCCCACTACACCTGGTTTGACTGGAATAACCAAAGCCTCCCCCACCAC 

NPPVSHYTWFDWNNQSLPHH 
AGCCAGAAGCTGAGATTGGAGCCGGTGAAGGTCCAGCACTCGGGTGCCTACTGGTGCCAG 

SQKLRLEPVKVQHSGAYWCQ 
GGGACCAACAGTGTGGGCAAGGGCCGTTCGCCTCTCAGCACCCTTACTGTCTACTATAGC 

GTNSVGKGRSPLSTLTVYYS 
CCGGAGACCATCGGCAGGCGAGTGGCTGTGGGACTCGGGTCCTGCCTCGCCATCCTCATC 

PETIGRRVAVGLGSCLAILI 
CTGGCAATCTGTGGGCTCAAGCTCCAGCGACGTTGGAAGAGGACACAGAGCCAGCAGGGG 

LAICGLKLQRRWKRTQSQQG 
CTTCAGGAGAATTCCAGCGGCCAGAGCTTCTTTGTGAGGAATAAAAAGGTTAGAAGGGCC 

LQENS SGQSFFVRNKKVRRA 
CCCCTCTCTGAAGGCCCCCACTCCCTGGGATGCTACAATCCAATGATGGAAGATGGCATT 

PLSEGPHSLGCYNPMMEDGI 
AGCTACACCACCCTGCGCTTTCCCGAGATGAACATACCACGAACTGGAGATGCAGAGTCC 

SYTTLRFPEMNIPRTGDAES 
TCAGAGATGCAGAGACCTCCCCGGACCTGCGATGACACGGTCACTTATTCAGCATTGCAC 

SEMQRPPRTCDDTVTYSALH 
AAGCGCCAAGTGGGCGACTATGAGAACGTCATTCCAGATTTTCCAGAAGATGAGGGGATT 
KRQVGDYENVIPDFPEDEGI 
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CATTACTCAGAGCTGATCCAGTTTGGGGTCGGGGAGCGGCCTCAGGCACAAGAAAATGTG 

HYSELIQFGVGERPQAQENV 
GACTATGTGATCCTCAAACATTGACACTGGATGGGCTGCAGCAGAGGCACTGGGGGCAGC 

D Y V I L K H 
GGGGGCCAGGGAAGTCCCCGAGTTTCCCCAGACACCGCCACATGGCTTCCTCCTGCGTGC 
ATGTGCGCACACACACACACACACGCACAC^CACACACACACACTCACTGCGGAGAACCT 
TGTGCCTGGCTCAGAGCCAGTCTTTTTGGTGAGGGTAACCCCAAACCTCCAAAACTC 
CCCCTGTTCTCTTCCACTCTCCTTGCTACCCAGAAATCATCTAAATACCT 
GCACACCTCCCCTGCCCCACCAGCCCACTGGCCATCTCCACCCGGAGCTGCTGTGTCCTC 
TGGATCTGCTCGTCATTTTCCTTCCCTTCTCCATCTCTCTO 

ACATCCCCACTCACGAATATTATGCCCAGTTTCTGCCTCTGAGGGAAAGCCCAGAAAAGG 
ACAGAAACGAAGTAGAAAGGGGCCCAGTCCTGGCCTGGCTTCTCCTTTGGAAGTGAGGCA 
TTGCACGGGGAGACGTACGTATCAGCGGCCCCTTGACTCTGGGGACTCCGGGTTTGAGAT 
GGACACACTGGTGTGGATTAACCTGCCAGGGAGACAGAGCTCACAATAAAAATGGCTCAG 
AT G C C ACTT C AAAG AAAAAAAAAA 



EXAMPLE 11 

The cellular apoptosts-regulating proteins also interact with the BHRF-1 protein of Epstein-Barr virus. BHRF-1 
(Baer et al. p 1984; Pearson et al., 1987) shares significant homology with the Bd-2 protein (Cleary et al., 1986; Williams 
and Smith, 1993). The BHRF-1 protein can protect against cell death induced by certain apoptosis-inducing stimuli 
such as serum depletion and the DMA damaging agents (Henderson et al., 1993). Sequences homologous to the 19 
kD motifs are present in the BHRF-1 protein. The two domains of the 19 kD and Bcl-2 proteins required for interaction 
with the cellular proteins are absent in Bax, a homolog of Bcl-2 (Oltvai et al., 1993). The absence of the two domains in 
Bax, which has a cell death promoting activity, supports the observation that the two domains are important for mediat- 
ing oell survival by the 19 kO and Bcl-2 proteins. 

The 19 kD protein of Ad2 and Ad5 contains a region (residues 60-69) (see Table 1) homologous to a conserved 
domain (termed domain I) present in various Bcl-2-related proteins, including Bax (Williams and Smith, 1993; Oltvai et 
aL, 1993). Nip1 , Nip2 and Nip3 do not interact with human Bax, which contains domain I, in the yeast two hybrid assay 
suggesting that the conserved domain I may not be essential for interaction of Nip1 , Nip2 and Nip3. However, Bcl-2 can 
suppress cell death promoted by Bax, possbiy by forming Bax-Bcl-2 heterodimerc (Oltvai et al., 1993). 

In the nematode C. elegans. apoptotic cell death is inhfoited by a survival promoting gene designated ced-9 (Hen- 
gartner et al., 1992). The ced-9 gene antagonizes the activity of two nematode cell death-inducing genes, ced-3 and 
ced-4, which are activated during programmed cell death (Yuan and Horvitz, 1990). Bcl-2 can substitute functionally for 
ced-9 in C. elegans (Vaux et a!., 1991). 

The mammalian IL-1p-oonverting enzyme (ICE), a homolog of ced-3 (Yuan et al., 1993; Miura et al.. 1993), induces 
ceB death in mammalian cells. The activity of ICE can be suppressed by Bcl-2 (Miura et al., 1993) aHhoutfi it is not 
known whether there is any physical interaction between ICE and Bcl-2. 

EXAMPLE 12 

The homology between Nip2 and Rho GTPase-activating protein (RhoGAP) raises the possibility that the signal 
transduction pathway may play a key role in counteracting the cell death inducing stimuli. In this context, it has recently 
been reported that the Bcl-2 protein associates with R-ras (Femandez-Sarabia & Bischoff, 1993), a member of Ras 
superfamily. Np2 may modulate the activity of RhoGAP through alternate mechanisms such a tarnation of RhoGAP- 
Nip2 heterodimers. Because the homology between Nip2 and RhoGAP is located upstream of the N-ter minus of active 
RhoGAP (Garrett et al., 1991). Nip2 might affect the activity of RhoGAP by interfering with the processing of RhoGAP 

Nip2 has a putative Ca^-binding motif. Since intracellular Ca* 2 appears to be an important mediator of apoptotic 
cell death, it is possble that the activity of Nip2 may be modulated by Ca 42 . Hence, interactions between the 1 9 Kd pro- 
tein and Np2 could remove the influence thereof on RhoGAP activity, thus affecting p21 rho and potentially other mem- 
bers of the ras superfamily. 
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SEQUENCE LISTING 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: St. Louis university 

(B) STREET: 1402 Souith Grand Blvd. 

(C) CITY: St. Louie 

(D) STATE: MO 

(E) COUNTRY: USA 

(F) ZIP: 63104 

(ii) TITLE OF INVENTION: APOPTOPSIS- REGULATING PROTEINS 
(iii) NUMBER OF SEQUENCES: 35 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unassigned 

(B) FILING DATE: Concurrently Herewith 

(C) CLASSIFICATION; 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/408,095 

(B) FILING DATE: 21 -MAR- 1995 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Ala Ala Pro Ala Pro Gly Phe Phe Ser 

i 5 



55 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

15 

Tyr Arg Arg Asp Phe Ala Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

20 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

25 (D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



30 

(xi) 

35 

(i) 

40 

(ii) 



SEQUENCE DESCRIPTION: SEQ ID NO: 3 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

MOLECULE TYPE: protein 



Tyr Lys Trp Glu Phe Glu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 

50 

Ala Ala Ala Ala Val Ala Phe Leu Ser 

1 5 
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10 



15 



30 



46 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(ix) FEATURE: 

20 (A) NAME/KEY: Modif ied-aite 

(B) LOCATION: 4 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

26 (B) LOCATION: 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Tyr Xaa Xaa Xaa Phe Xaa Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 < A ) LENGTH : 8 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 



(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 



(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 5 

50 (ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ala Ala Xaa Ala Xaa Phe Xaa Ser 

6 1 5 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 



20 



25 



30 



35 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ala Ala Pro Ala Pro Gly lie Phe Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:B: 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
40 1 5 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 



Gly Ser Ser Gin Ala Lys Leu Val Cys Arg lie Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 

50 55 60 

Leu Asn Leu Gly His Gin Ala Leu Phe Gin Glu Lys Val He Lys Thr 
65 70 75 80 
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10 



15 



20 



Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
100 105 HO 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 
115 120 125 

His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala He He 

130 135 140 

Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Arg Arg Gin 
145 150 155 160 

Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 
165 170 175 

(2) INFORMATION FOR SBQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

26 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



30 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 9: 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Ala Ser Leu 
15 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 

Gly Ser Ser Gin Ala Lys Leu Val Cys Arg He Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 
50 55 60 

46 Leu Asn Leu Gly His Gin Ala Leu Phe Gin Glu Lys Val He Lys Thr 

65 70 75 80 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala val Ala Phe Leu 

85 90 95 



35 



40 



SO 



Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
100 105 HO 



55 
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w 



15 



20 



30 



35 



40 



50 



Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 
115 120 125 

His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala lie lie 
130 135 140 

Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Arg Arg Gin 
145 150 155 160 

Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE SS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10; 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
X 5 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Ala Ser Leu Trp 
20 25 30 

Gly Ser Ser Gin Ala Lys Leu Val Cys Arg lie Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 
50 55 60 

Leu Asn Leu Gly His Gin Ala Leu Phe Gin Glu Lys Val lie Lys Thr 
65 70 75 80 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

46 Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 

100 105 HO 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 

115 120 125 



His Lya Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala He He 
130 135 140 
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Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Axg Arg Gin 
145 150 155 160 

Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 

165 170 175 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 
(C} STRANDEDNESS : 

(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



20 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
15 10 15 



25 



30 



35 



Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 

Gly Ser Ser Gin Ala Lys Leu Val Cys Arg He Lys Glu Asp Tyr Lys 
35 40 45 

Trp Ala Ser Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 
50 55 €0 

Leu Asn Leu Gly His Gin Ala Leu Phe Gin Glu Lys Val lie Lys Thr 
65 70 75 80 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
40 !00 105 HO 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 
115 120 125 

46 His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala He He 

130 135 140 

Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Arg Arg Gin 

145 150 155 160 



SO 



Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 
165 170 175 



55 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
s (A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
1 5 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 

20 25 30 

Gly Ser Ser Gin Ala Lys Leu Val Cys Arg lie Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 

25 50 55 60 

Leu Asn Leu Gly His Gin Ala Leu Phe Gin Ala Ser Val He Lys Thr 
65 70 75 80 



30 



35 



40 



45 



SO 



Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
100 105 HO 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 
115 120 125 

His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala He He 
130 135 140 

Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Arg Arg Gin 
145 ISO 155 160 

Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 
165 170 175 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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25 



(ii) MOLECULE TYPE ; protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
10 1 5 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 

75 Gly Ser Ser Gin Ala Lys Leu Val Cys Arg He Lys Glu Asp Tyr Lys 

35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 
50 55 60 

Leu Asn Leu Gly His Gin Ala Leu Phe Gin Glu Lys Val He Lys Thr 
65 70 75 80 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ser Phe He Lys Asp Lys Trp 
85 90 95 

Ser Glu Glu Thr His Leu Ser Gly Gly Tyr Leu Leu Asp Phe Leu Ala 
100 105 HO 

Met His Leu Trp Arg Ala Val Val Arg His Lys Asn Arg Leu Leu Leu 
115 120 125 

Leu Ser Ser Val Arg Pro Ala He He Pro Thr Glu Glu Gin Gin Gin 
130 135 140 

Glu Glu Ala Arg Arg Arg Arg Arg Gin Glu Gin Ser Pro Trp Asn Pro 
145 150 155 160 

Arg Ala Gly Leu Asp Pro Arg Glu 
165 

(2) INFORMATION FOR SEQ ID NO: 14: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D| TOPOLOGY: linear 



30 



35 



40 



50 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
15 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 

Gly Ser Ser Gin Ala Lys Leu Val Cys Arg lie Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 
50 55 60 

15 Leu Asn Leu Gly Hie Gin Ala Leu Phe Gin Glu Lys Val He Lys Thr 

65 70 75 60 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

20 

Ser Phe lie Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
100 105 110 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Ala Ser Ala Val Val Arg 

115 120 125 

His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala He He 
130 135 140 

Pro Thr Glu Glu Gin Gin Gin Glu Glu Ala Arg Arg Arg Arg Arg Gin 
30 145 150 155 160 

Glu Gin Ser Pro Trp Asn Pro Arg Ala Gly Leu Asp Pro Arg Glu 

165 170 175 



25 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 
(BJ TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



Met Glu Ala Trp Glu Cys Leu Glu Asp Phe Ser Ala Val Arg Asn Leu 
15 10 15 

Leu Glu Gin Ser Ser Asn Ser Thr Ser Trp Phe Trp Arg Phe Leu Trp 
20 25 30 
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15 



Gly Ser Ser Gin Ala Lys Leu Val Cys Arg lie Lys Glu Asp Tyr Lys 
35 40 45 

Trp Glu Phe Glu Glu Leu Leu Lys Ser Cys Gly Glu Leu Phe Asp Ser 

50 55 60 

Leu Asn Leu Gly Hie Gin Ala Leu Phe Gin Glu Lya Val lie Lys Thr 
65 70 75 80 

Leu Asp Phe Ser Thr Pro Gly Arg Ala Ala Ala Ala Val Ala Phe Leu 
85 90 95 

Ser Phe He Lys Asp Lys Trp Ser Glu Glu Thr His Leu Ser Gly Gly 
100 105 110 

Tyr Leu Leu Asp Phe Leu Ala Met His Leu Trp Arg Ala Val Val Arg 
115 120 125 

His Lys Asn Arg Leu Leu Leu Leu Ser Ser Val Arg Pro Ala lie He 

20 130 135 140 

Pro Thr Ser 
145 

26 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



30 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Met Ala His Ala Gly Arg Ser Gly Tyr Asp Asn Arg Glu He Val Met 
40 i 5 10 15 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

46 Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly Phe 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 



50 



Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 80 
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Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Thr 
85 90 95 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Arg Asp Phe 
100 105 110 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
115 120 125 

Cys Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 

130 135 140 

Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 
145 150 155 160 

Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn He Ala Leu Trp 
165 170 175 

Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp He Gin Asp Asn 

20 180 185 190 

Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr Gly Pro Ser Met Arg Pro 
195 200 205 



15 



25 



30 



40 



46 



50 



Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys Thr Leu Leu Ser Leu Ala 
210 215 220 

Leu Val Gly Ala Cys He Thr Leu Gly Ala Tyr Leu Gly His Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS; 

(A} LENGTH: 232 amino acids 
(Bl TYPE: amino acid 

35 (C) STRANDEDNESS : 

(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala His Ala Gly Arg Ser Gly Tyr Asp Asn Arg Glu He Val Met 
15 10 15 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Phe Ser Ser Gin Pro Gly His 
35 40 45 



55 



33 



EP 0 733 706 A2 



10 



Thr Pro His Pro Ma Ala Ser Arg Asp Pro Val Ala Arg Thr Ser Pro 
50 55 60 

Leu Gin Thr Pro Ala Ala Pro Gly Ala Ala Ala Gly Pro Ala Leu Ser 

65 70 75 80 

Pro Val Pro Pro Val Val His Leu Thr Leu Arg Gin Ala Gly Asp Aap 
85 90 95 

Phe Ser Arg Arg Tyr Arg Arg Aap Phe Ala Glu Met Ser Ser Gin Leu 

100 105 110 

His Leu Thr Pro Phe Thr Ala Arg Gly Cys Phe Ala Thr Val Val Glu 
115 120 125 

15 

Glu Leu Phe Arg Asp Gly Val Asn Trp Gly Arg lie Val Ala Phe Phe 
130 135 140 

Glu Phe Gly Gly Val Met Cys Val Glu Ser Val Asn Arg Glu Met Ser 
20 145 iso 155 160 

Pro Leu Val Asp Asn He Ala Leu Trp Met Thr Glu Tyr Leu Asn Arg 
165 170 175 

25 His Leu His Thr Trp He Gin Asp Asn Gly Gly Trp Asp Ala Phe Val 

180 185 190 

Glu Leu Tyr Gly Pro Ser Met Arg Pro Leu Phe Asp Phe Ser Trp Leu 

195 200 205 



30 



35 



40 



46 



Ser Leu Lys Thr Leu Leu Ser Leu Ala Leu Val Gly Ala Cys He Thr 
210 215 220 

Leu Gly Ala Tyr Leu Gly His Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

50 

Met Ala His Ala Gly Arg Ser Gly Tyr Asp Asn Arg Glu He Val Met 
15 io 15 
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Lys Tyr He Hxs Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

6 Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly Phe 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 

50 55 60 

10 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ser 

65 70 75 80 

Pro Val Pro Pro Val Val His Leu Thr Leu Arg Gin Ala Gly Asp Asp 
15 85 90 95 

Phe Ser Arg Arg Tyr Arg Arg Asp Phe Ala Glu Met Ser Ser Gin Leu 
100 105 110 

20 His Leu xhr Pro phe Thr Ala Arg Gly Cys Phe Ala Thr Val Val Glu 

115 120 125 

Glu Leu Phe Arg Asp Gly Val Asn Trp Gly Arg He Val Ala Phe Phe 
130 135 140 

25 

Glu Phe Gly Gly Val Met Cys Val Glu Ser Val Asn Arg Glu Met Ser 
145 150 155 160 

Pro Leu Val Asp Asn lie Ala Leu Trp Met Thr Glu Tyr Leu Asn Arg 
30 16S 170 175 

Hie Leu His Thr Trp He Gin Asp Asn Gly Gly Trp Asp Ala Phe Val 
180 185 190 



35 



40 



45 



50 



Glu Leu Tyr Gly Pro Ser Met Arg Pro Leu Phe Asp Phe Ser Trp Leu 
195 200 205 

Ser Leu Lys Thr Leu Leu Ser Leu Ala Leu Val Gly Ala Cys He Thr 
210 215 220 

Leu Gly Ala Tyr Leu Gly His Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

£ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ala His Ala Gly Arg Ser Gly Tyr Asp Asn Arg Glu Tie Val Met 
l 5 10 15 

5 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly Phe 
10 35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 



15 



20 



Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 

65 70 75 80 

Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Thr 
85 90 95 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Ser Ser Gin Leu His Leu Thr 
100 105 HO 

Pro Phe Thr Ala Arg Gly Cys Phe Ala Thr Val Val Glu Glu Leu Phe 

115 120 125 

Arg Asp Gly Val Asn Trp Gly Arg He Val Ala Phe Phe Glu Phe Gly 
130 13S 140 

Gly Val Met Cys Val Glu Ser Val Asn Arg Glu Met Ser Pro Leu Val 
30 145 150 155 160 

Asp Asn He Ala Leu Trp Met Thr Glu Tyr Leu Asn Arg His Leu His 

165 170 175 



25 



35 



40 



46 



50 



Thr Trp He Gin Asp Asn Gly Gly Trp Asp Ala Phe Val Glu Leu Tyr 
180 185 190 

Gly Pro Ser Met Arg Pro Leu Phe Asp Phe Ser Trp Leu Ser Leu Lys 
195 200 205 

Thr Leu Leu Ser Leu Ala Leu Val Gly Ala Cys He Thr Leu Gly Ala 
210 215 220 

Tyr Leu Gly His Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



10 



15 



35 



45 



50 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION; 11. .694 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGTCCCCAAC ATG GCG GCT CCC CAA QAC GTC CAC GTC CGG ATC TGT AAC 49 
Met Ala Ala Pro Gin Asp Val His Val Arg lie Cys Asn 
1 5 10 

CAA GAG ATT GTC AAA TTT GAC CTG GAG GTG AAG GCG CTT ATT CAG GAT 97 
Gin Glu He Val Lys Phe Asp Leu Glu Val Lys Ala Leu He Gin Asp 
15 20 25 

ATC CGT GAT TGT TCA GGA CCC TTA AGT GCT CTT ACT GAA CTG AAT ACT 145 
20 lie Arg Asp Cys Ser Gly Pro Leu Ser Ala Leu Thr Glu Leu Asn Thr 

30 35 40 45 

AAA GTA AAA GAG AAA TTT CAA CAG TTG CGT CAC AGA ATA CAG GAC CTG 193 
Lya Val Lys Glu Lys Phe Gin Gin Leu Arg His Arg He Gin Asp Leu 

25 50 55 60 

GAG CAG TTG GCT AAA GAG CAA GAC AAA GAA TCA GAG AAA CAA CTT CTA 241 
Glu Gin Leu Ala Lys Glu Gin Asp Lys Glu Ser Glu Lys Gin Leu Leu 

65 70 75 

30 

CTC CAG GAA GTG GAG AAT CAC AAA AAG CAG ATG CTC AGC AAT CAG GCC 289 
Leu Gin Glu Val Glu Asn His Lys Lys Gin Met Leu Ser Asn Gin Ala 
80 85 90 

TCA TGG AGG AAA GCT AAT CTC ACC TGC AAA ATT GCA ATC GAC AAT CTA 337 
Ser Trp Arg Lys Ala Asn Leu Thr Cys Lys He Ala He Asp Asn Leu 
95 100 105 

GAG AAA GCA GAA CTT CTT CAG GGA GGA GAT CTC TTA AGG CAA AGG AAA 385 
Glu Lys Ala Glu Leu Leu Gin Gly Gly Asp Leu Leu Arg Gin Arg Lys 
40 110 115 120 125 

ACC ACC AAA GAG AGC CTG GCC CAG ACA TCC AGT ACC ATC ACT GAG AGC 433 
Thr Thr Lys Glu Ser Leu Ala Gin Thr Ser Ser Thr He Thr Glu Ser 
130 135 1*0 



CTC ATG GGG ATC AGC AGG ATG ATG GCC CAG CAG GTC CAG CAG AGC GAG 481 
Leu Met Gly He Ser Arg Met Met Ala Gin Gin Val Gin Gin Ser Glu 
145 150 155 

GAG GCC ATG CAG TCT CTA GTC ACT TCT TCA CGA ACQ ATC CTG GAT GCA 529 
Glu Ala Met Gin Ser Leu Val Thr Ser Ser Arg Thr He Leu Asp Ala 
160 165 170 
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15 



20 



25 



30 



SO 



AAT GAA GAA TTT AAG TCC ATG TCG GGC ACC ATC CAG CTG GGC CGG AAG 5 77 

Asn Glu Glu Phe Lys Ser Met Ser Gly Thr He Gin Leu Gly Arg Lys 
175 180 185 

CTT ATC ACA AAA TAC AAT CGC CGG GAG CTG ACG GAC AAG CTT CTC ATC 625 
Leu He Thr Lys Tyr Asn Arg Arg Glu Leu Thr Asp Lys Leu Leu He 
190 195 200 205 

TTC CTT GCG CTA CGC CTG TTT CTT GCT ACG GTC CTC TAT ATT GTG AAA 673 
Phe Leu Ala Leu Arg Leu Phe Leu Ala Thr Val Leu Tyr He Val Lys 
210 215 220 

AAG CGG CTC TTT CCA TTT TTG TGAGATCCCA AAGGTGCCAG TTCTGGCCCT 724 
Lys Arg Leu Phe Pro Phe Leu 

225 

TTCAGCTCCT GTTTCAGGAT CTGTCCTGGT TC CTG AG CTC TAGGCTGCTA AGCTGAGCCA 7 84 

CACACCCCTC CGTTTTGCAC CAGTTGCCTG CAGGTTGGAT GGAACACAGT GCCCCACTTT 844 

TCTGCAAGTA GCTGGCTTGT AAAGGGTGAA CAGAGCCATG GGAGGAAGGT CTGGCATTGG 904 

GATGCCGCCC TGGGGACATA CGAACCGCCT CCTTCCACCA TTGTGCACTA TGGGAGGCCG 964 

CTGCTGCGTG GAGCACTTAA AGTCCAGCCT CCAGGACCGG ATGCCCCTCC TGTCTCCCGC 1024 

TCCCATCGTG CCCTTAAATG CCAGATCTGG TGGAGGGAAG AGAGAAGAGG TAGGAAGAAA 1084 

GGTGATGAAA ACTCCTG 1101 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Ala Ala Pro Gin Asp Val His Val Arg He Cys Asn Gin Glu He 
15 10 15 

46 Val Lys Phe Asp Leu Glu Val Lys Ala Leu He Gin Asp He Arg Asp 

20 25 30 

Cys Ser Gly Pro Leu Ser Ala Leu Thr Glu Leu Asn Thr Lys Val Lys 
35 40 45 



Glu Lys Phe Gin Gin Leu Arg His Arg He Gin Asp Leu Glu Gin Leu 
50 55 €0 
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Ala Lys Glu Gin Asp Lys Glu Ser Glu Lys Gin Leu Leu Leu Gin Glu 
65 70 75 80 

Val Glu Asn His Lye Lys Gin Met Leu Ser Asn Gin Ala Ser Trp Arg 
85 90 95 

Lya Ala Asn Leu Thr Cys Lys He Ala lie Asp Asn Leu Glu Lys Ala 
100 105 HO 

Glu Leu Leu Gin Gly Gly Asp Leu Leu Arg Gin Arg Lys Thr Thr Lys 
115 120 125 

Glu Ser Leu Ala Gin Thr Ser Ser Thr He Thr Glu Ser Leu Met Gly 
130 135 140 

He Ser Arg Met Met Ala Gin Gin Val Gin Gin Ser Glu Glu Ala Met 
145 150 155 160 

Gin Ser Leu Val Thr Ser Ser Arg Thr He Leu Asp Ala Asn Glu Glu 
165 170 175 

Phe Lys Ser Met Ser Gly Thr He Gin Leu Gly Arg Lys Leu He Thr 
180 185 190 

Lys Tyr Asn Arg Arg Glu Leu Thr Asp Lys Leu Leu He Phe Leu Ala 
195 200 205 

Leu Arg Leu Phe Leu Ala Thr Val Leu Tyr He Val Lys Lys Arg Leu 
210 215 220 

Phe Pro Phe Leu 
225 

(2) INFORMATION FOR SEQ ID NO: 22: 

35 (j.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNBSS : single 
(D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 214.-1155 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
50 CTGCGGCCGG GGGATTGGGC CGGGGTCTCC ACCGCCGACC GAGGGGAGCG GCGTCCGCTC 60 

GGCCCTGCTT TTTGCGACCT GCCGTCAGCC CCACGTCGCC GGCCTGGAGG GGCOAAGAGG 120 
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ACGAGGGGCG ACGAAGGCCC AAGGCTTCCT CCGGGGACAT TGGCTCCCTG GATTATCAAG 18 0 

CAGTTTGTAG TTGACATTGA ATCCAGGCTG AGG ATG GAA GOT GTG GAA CTT AAA 2 34 

Met Glu Gly Val Glu Leu Lys 

230 235 

GAA GAA TGG CAA GAT GAA GAT TTT CCG ATA CCT TTA CCA GAA GAT GAT 282 
Glu Glu Trp Gin Asp Glu Asp Phe Pro He Pro Leu Pro Glu Asp Asp 
240 245 250 

AGT ATT GAA GCA GAT ATA CTA OCT ATA ACT GGA CCA GAG GAC CAG CCT 330 
Ser He Glu Ala Asp He Leu Ala He Thr Gly Pro Glu Asp Gin Pro 
255 260 265 

15 GGC TCA CTA GAA GTT AAT GGA AAT AAA GTG AGA AAG AAA CTA ATG GCT 378 

Gly Ser Leu Glu Val Asn Gly Asn Lys Val Arg Lys Lys Leu Met Ala 
270 275 280 

CCA GAC ATT AGC CTG ACA CTG GAT CCT AGT GAT GGC TCT GTA TTG TCA 426 
20 pro Asp He Ser Leu Thr Leu Asp Pro Ser Asp Gly Ser Val Leu Ser 

285 290 295 

GAT GAT TTG GAT GAA AGT GGG GAG ATT GAC TTA GAT GGC TTA GAC ACA 474 
Asp Asp Leu Asp Glu Ser Gly Glu He Asp Leu Asp Gly Leu Asp Thr 

26 300 305 310 315 

CCG TCA GAG AAT AGT AAT GAG TTT GAG TGG GAA GAT GAT CTT CCA AAA 522 
Pro Ser Glu Asn Ser Asn Glu Phe Glu Trp Glu Asp Asp Leu Pro Lys 
320 325 330 



30 



35 



45 



50 



CCC AAG ACT ACT GAA GTA ATT AGG AAA GGC TCA ATT ACT GAA TAC ACA 570 
Pro Lys Thr Thr Glu Val He Arg Lys Gly Ser He Thr Glu Tyr Thr 
335 340 345 

GCA GCA GAG GAA AAA GAA GAT GGA CGA CGC TGG CGT ATG TTC AGG ATT 61B 
Ala Ala Glu Glu Lys Glu Asp Gly Arg Arg Trp Arg Met Phe Arg He 
350 355 360 



GGA GAA CAG GAC CAC AGG GTT GAT ATG AAG GCA ATT GAA CCC TAT AAA 666 
Gly Glu Gin Asp His Arg Val Asp Met Lys Ala He Glu Pro Tyr Lys 
40 365 370 375 

AAA GTT ATC AGC CAT GGG GGA TAT TAT GGG GAT GGA TTA AAT GCC ATT 714 
Lys Val He Ser His Gly Gly Tyr Tyr Gly Asp Gly Leu Asn Ala He 
380 365 390 395 



GTT GTA TTT GCT GTC TGT TTC ATG CCT GAA AGT AGT CAG CCT AAC TAT 762 
Val Val Phe Ala Val Cys Phe Met Pro Glu Ser Ser Gin Pro Asn Tyr 
400 405 410 

AGA TAC CTG ATG GAC AAT CTT TTT AAA TAT GTT ATT GGC ACT TTG GAG 810 
Arg Tyr Leu Met Asp Asn Leu Phe Lys Tyr Val He Gly Thr Leu Glu 
415 420 425 
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CTA TTA GTA GCA GAA AAC TAC ATG ATA GTT TAT TTA AAT GGT GCA AC A 8 58 

Leu Leu Val Ala Glu Asn Tyr Met lie Val Tyr Leu Asn Gly Ala Thr 

430 435 440 

ACT CGA AGA AAA ATG CCC AGT CTG GGA TGG CTC AGG AAA TGT TAT CAG 906 

Thr Arg Arg Lys Met Pro Ser Leu Gly Trp Leu Arg Lys Cys Tyr Gin 

445 450 455 

CAA ATT GAT AGA AGG TTA CGG AAA AAT CTA AAA TCC CTA ATC ATT GTA 954 

Gin He Asp Arg Arg Leu Arg Lys Asn Leu Lys Ser Leu He He Val 
460 465 470 475 

CAT CCT TCT TGG TTT ATC AGA ACA CTT CTG GCT GTT ACA AGA CCA TTT 1002 

His Pro Ser Trp Phe He Arg Thr Leu Leu Ala Val Thr Arg Pro Phe 
15 480 485 490 

ATT AGC TCG AAA TTC AGC CAA AAA ATT AGA TAC GTG TTT AAT TTG GCA 1050 

He Ser Ser Lys Phe Ser Gin Lys He Arg Tyr Val Phe Asn Leu Ala 
495 500 505 
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GAA CTA GCA GAA CTT GTC CCC ATG GAA TAC GTT GGC ATA CCA GAA TGC 1098 
Glu Leu Ala Glu Leu Val Pro Met Glu Tyr Val Gly He Pro Glu Cys 
510 515 520 

ATA AAA CAA GTT GAT CAA GAA CTT AAT GGA AAA CAA GAT GAA CCG AAA 1146 
He Lys Gin Val Asp Gin Glu Leu Asn Gly Lys Gin Asp Glu Pro Lys 
525 530 535 

AAT GAA CAG TAAGTTTGGC ATCTAGTCCA AACAAGACTG AAGAATGTGC 1195 

Asn Glu Gin 

540 

TGATGGAGCA GTGCTGTTTC TGCATTCATA ATGCATTTAT TGGC CAT ATT TTTATGTAAC 12 55 
CTGTTACAAA ATAGACTTGA CTTTTTCATA ATGGACTTTT GTATTATACA AGGGACTGTT 1315 
CACTGCTGTA CTGGTTTGCA AATTTCTTGA ATTTAGCTCT TTATGCTAAC TGTATTATTA 1375 
TCATTTTATA T 

(2) INFORMATION FOR SBQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Glu Gly Val Glu Leu Lys Glu Glu Trp Gin Asp Glu Asp Phe Pro 
15 10 15 
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lie Pro Leu Pro Glu Asp Asp Ser lie Glu Ala Asp He Leu Ala He 
20 25 30 

Thr Gly Pro Glu Asp Gin Pro Gly Ser Leu Glu Val Asn Gly Asn Lys 
35 40 45 

Val Arg Lys Lys Leu Met Ala Pro Asp He Ser Leu Thr Leu Asp Pro 
50 55 60 

Ser Asp Gly Ser Val Leu Ser Asp Asp Leu Asp Glu Ser Gly Glu He 

65 70 75 SO 

Asp Leu Asp Gly Leu Asp Thr Pro Ser Glu Asn Ser Asn Glu Phe Glu 
85 90 95 

Trp Glu Asp Asp Leu Pro Lys Pro Lys Thr Thr Glu Val He Arg Lys 
100 HO 

Gly Ser He Thr Glu Tyr Thr Ala Ala Glu Glu Lys Glu Asp Gly Arg 
20 115 120 125 

Arg Trp Arg Met Phe Arg He Gly Glu Gin Asp His Arg Val Asp Met 
130 135 140 

26 Lys Ala He Glu Pro Tyr Lys Lys Val He Ser His Gly Gly Tyr Tyr 

!45 150 155 160 

Gly Asp Gly Leu Asn Ala He Val Val Phe Ala Val Cys Phe Met Pro 

165 170 I* 7 * 



15 



30 



35 



SO 



55 



Glu Ser Ser Gin Pro Asn Tyr Arg Tyr Leu Met Asp Asn Leu Phe Lys 
180 185 190 

Tyr Val lie Gly Thr Leu Glu Leu Leu Val Ala Glu Asn Tyr Met lie 
195 200 205 

Val Tyr Leu Asn Gly Ala Thr Thr Arg Arg Lys Met Pro Ser Leu Gly 
210 215 220 



Trp Leu Arg Lys Cys Tyr Gin Gin He Asp Arg Arg Leu Arg Lys Asn 
40 2 £ 230 235 240 

Leu Lys Ser Leu He He Val His Pro Ser Trp Phe He Arg Thr Leu 
245 250 255 

46 Leu Ala Val Thr Arg Pro Phe lie Ser Ser Lys Phe Ser Gin Lys lie 

260 265 270 

Arg Tyr Val Phe Asn Leu Ala Glu Leu Ala Glu Leu Val Pro Met Glu 
275 280 285 



Tyr Val Gly He Pro Glu Cys lie Lys Gin Val Asp Gin Glu Leu Asn 
290 295 300 
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Gly Lys Gin Asp Glu Pro Lys Asn Glu Gin 
305 310 

(2) INFORMATION FOR SKQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: CDNA 



15 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 109.. 690 

20 (xi) SEQUENCE DESCRIPTION : SEQ ID NO:24: 

AAAGAGATCT GGAATTCGGA TCCTCGAGGC CACGAAGGCC ACCGCCCGCA GCTGAAGCAC 60 

ATCCGCAGCC CGGCGCGACT CCGATCGCCG CAGTTGCCCT CTGGCGCC ATG TCC GAG 117 

Met Ser Glu 
315 



25 



30 
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AAC GGA GCG CCC GGG ATG GAG GAG GAG AGC CTG CAG GGC TCC TGG GTA 165 
Asn Gly Ala Pro Gly Met Gin Glu Glu Ser Leu Gin Gly Ser Trp Val 
320 325 330 

GAA CTG CAC TTC AGC AAT AAT GGG AAC GGG GGC AGC GTT CCA GCC TCG 213 
Glu Leu His Phe Ser Asn Asn Gly Asn Gly Gly Ser Val Pro Ala Ser 
335 340 345 

GTT TCT ATT TAT AAT GGA GAC ATG GAA AAA ATA CTG CTG GAC GCA CAG 261 
Val Ser lie Tyr Asn Gly Asp Met Glu Lys lie Leu Leu Asp Ala Gin 
350 355 360 365 

CAT GAG TCT GGA CGG AGT AGC TCC AAG AGC TCT CAC TGT GAC AGC CCA 309 
His Glu Ser Gly Arg Ser Ser Ser Lys Ser Ser His Cyc Asp Ser Pro 
370 375 380 

CCT CGC TCG CAG ACA CCA CAA GAT ACC AAC AGG OCT TCT GAA ACA OAT 357 
Pro Arg Ser Gin Thr Pro Gin Asp Thr Asn Arg Ala Ser Glu Thr Asp 
385 390 395 

ACC CAT AGC ATT GGA GAG AAA AAC AGC TCA CAG TCT GAG GAA GAT GAT 405 
Thr His Ser He Gly Glu Lys Asn Ser Ser Gin Ser Glu Glu Asp Asp 
400 405 410 

50 A TT AGA AGG AAA GAA GTT GAA AGC ATC TTG AAG AAA AAC TCA GAT 453 

He Glu Arg Arg Lys Glu Val Glu Ser He Leu Lys Lys Asn Ser Asp 
415 420 425 



40 



46 



55 



43 



EP 0 733 706 A2 



w 



26 



35 



40 



45 



50 



TGG ATA TGG GAT TGG TCA AGT CGG CCG GAA AAT ATT CCC CCC AAG GAG 501 

Trp lie Trp Asp Trp Ser Ser Arg Pro Glu Asn lie Pro Pro Lys Glu 
430 435 440 445 

TTC CTC TTT AAA CAC CCG AAG CGC ACG GCC ACC CTC AGC ATG AGG AAC 549 

Phe Leu Phe Lye His Pro Lys Arg Thr Ala Thr Leu Ser Met Arg Asn 
450 455 460 

ACG AGC GTC ATG AAG AAA GOG GGC ATA TTC TCT GCA GAA TTT CTG AAA 597 

Thr Ser Val Met Lys Lya Gly Gly He Phe Ser Ala Glu Phe Leu Lys 
465 470 475 



GTT TTC CTT CCA TCT CTG CTG CTC TCT CAT TTG CTG GCC ATC GGA TTG 645 
Val Phe Leu Pro Ser Leu Leu Leu Ser His Leu Leu Ala lie Gly Leu 
15 480 485 490 

GGG ATC TAT ATT GGA AGG CGT CTG ACA ACC TCC ACC AGC ACC TTT 6 90 

Gly lie Tyr He Gly Arg Arg Leu Thr Thr Ser Thr Ser Thr Phe 
495 500 505 

20 

TGATGAAGAA CTGGAGTCTG ACTTGGTTCG TTAGTGGATT ACTTCTGAGC TTGCAACATA 7 50 

GCTCACTGAA GAGCTGTTAG ATCCTGGGCC TTCGTGGCTC GAGAGACTAG AATCGCAGAT 810 

ACGAAAACCC CGCAGC 826 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Ser Glu Asn Gly Ala Pro Gly Met Gin Glu Glu Ser Leu Gin Gly 
15 10 15 

Ser Trp Val Glu Leu His Phe Ser Asn Asn Gly Asn Gly Gly Ser Val 
20 25 30 

Pro Ala Ser Val Ser He Tyr Asn Gly Asp Met Glu Lys He Leu Leu 
35 40 45 

Asp Ala Gin His Glu Ser Gly Arg Ser Ser Ser Lys Ser Ser His Cys 
50 55 60 

Asp Ser Pro Pro Arg Ser Gin Thr Pro Gin Asp Thr Asn Arg Ala Ser 
65 70 75 80 

Glu Thr Asp Thr His Ser He Gly Glu Lys Asn Ser Ser Gin Ser Glu 
85 90 95 
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Glu Asp Asp He Glu Arg Arg Lys Glu Val Glu Ser He Leu Lys Lys 
100 105 HO 

Asn Ser Asp Trp He Trp Asp Trp Ser Ser Arg Pro Glu Asn He Pro 

115 120 125 

Pro Lys Glu Phe Leu Phe Lya His Pro Lys Arg Thr Ala Thr Leu Ser 
130 135 140 

Met Arg Asn Thr Ser Val Met Lys Lys Gly Gly He Phe Ser Ala Glu 
1 45 150 155 160 

Phe Leu Lys Val Phe Leu Pro Ser Leu Leu Leu Ser His Leu Leu Ala 
165 170 175 

He Gly Leu Gly He Tyr He Gly Arg Arg Leu Thr Thr Ser Thr Ser 
180 185 190 

Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 26.. 505 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

CAGCATCGCC GCCGCCAGAG GAGAA ATG TCT GAA GTA AGA CCC CTC TCC AGA 52 

Met Ser Glu Val Arg Pro Leu Ser Arg 
195 200 

100 



GAC ATC TTG ATG GAG ACC CTC CTG TAT GAG CAG CTC CTG GAA CCC CCG 
Asp He Leu Met Glu Thr Leu Leu Tyr Glu Gin Leu Leu Glu Pro Pro 
205 210 215 

ACC ATG GAG GTT CTT GGC ATG ACT GAC TCT GAA GAG GAC CTG GAC CCT 148 
Thr Met Glu Val Leu Gly Met Thr Asp Ser Glu Glu Asp Leu Asp Pro 
220 225 230 235 

50 ATG GAG GAC TTC GAT TCT TTG GAA TGC ATG GAG GGC AGT GAC GCA TTG 196 

Met Glu Asp Phe Asp Ser Leu Glu Cys Met Glu Gly Ser Asp Ala Leu 
240 245 250 
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GCC CTG CGG CTG GCC TGC ATC GGG GAC GAG ATG GAC GTG AGC CTC AGG 244 
Ala Leu Arg Leu Ala Cys lie Gly Asp Glu Met Asp Val Ser Leu Arg 
255 260 265 

GCC CCG CGC CTG GCC CAG CTC TCC GAG GTG GCC ATG CAC AGC CTG GGT 2 92 

Ala Pro Arg Leu Ala Gin Leu Ser Glu Val Ala Met His Ser Leu Gly 
270 275 280 

CTG GCT TTC ATC TAC GAC CAG ACT GAG GAC ATC AGG GAT GTT CTT AGA 340 
Leu Ala Phe lie Tyr Asp Gin Thr Glu Asp He Arg Asp Val Leu Arg 

285 290 295 

AGT TTC ATG GAC GGT TTC ACC ACA CTT MG GAG AAC ATA ATG AGG TTC 38 8 

Ser Phe Met Asp Gly Phe Thr Thr Leu Lys Glu Asn He Met Arg Phe 

300 305 310 315 

TGG AGA TCC CCG AAC CCC GGG TCC TGG GTG TCC TGC GAA CAG GTG CTG 436 
Trp Arg Ser Pro Asn Pro Gly Ser Trp Val Ser Cys Glu Gin Val Leu 

320 325 330 

CTG GCG CTG CTG CTG CTG CTG GCG CTG CTG CTG CCG CTG CTC AGC GGG 484 
Leu Ala Leu Leu Leu Leu Leu Ala Leu Leu Leu Pro Leu Leu Ser Gly 
335 340 345 

GGC CTG CAC CTG CTG CTC AAG TGAGGCCCCG GCGGCTCAGG GCGTGGCTGG 53 5 
Gly Leu His Leu Leu Leu Lys 
350 

CCCCACCCCC ATGACCACTG CCCTGGAGGT GGCGGCCTGC TGCTGTTATC TTTTTAACTG 595 

TTTTCTCATG ATGCCTTTTT ATATTTAAAC CCCGAGATAG TGCTGGAACA CTGCTGAGGT 655 

TTTATACTCA GGTTTTTTGT TTTTTTTTTA TTCCAGTTTT CGTTTTTTCT AAAAGATGAA 715 

TTCCTATGGC TCTGCAATTG TCACCGGTTA ACTGTGGCCT GTGCCCAGGA AGAGCCATTC 77 5 

ACTCCTGCCC CTGCCCACAC GGCAGGTAGC AGGGGGAGTG CTGGTCACAC CCCTGTGTGA 835 

TATGTGATGC CCTCGGCAAA GAATCTACTG GAATAGATTC CGAGGAGCAG GAGTGCTCAA 895 

TAAAATGTTG GTTTCCAGCA AAAAAAAAAA AAAGGCCTTC GTGGCCTCGA G 946 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:27: 
Met Ser Glu Val Arg Pro Leu Ser Arg Asp lie Leu Met Glu Thr Leu 

Leu Tyr Glu Gin Leu Leu Glu Pro Pro Thr Met Glu Val Leu Gly Met 
20 25 30 

Thr Asp Ser Glu Glu Asp Leu Asp Pro Met Glu Asp Phe Asp Ser Leu 
10 35 40 45 

Glu Cya Met Glu Gly Ser Asp Ala Leu Ala Leu Arg Leu Ala Cys lie 
50 55 60 

ts Gly Asp Glu Met Asp Val Ser Leu Arg Ala Pro Arg Leu Ala Gin Leu 

65 70 7b 80 

Ser Glu Val Ala Met His Ser Leu Gly Leu Ala Phe He Tyr Asp Gin 
85 90 95 

20 

Thr Glu Asp He Arg Asp Val Leu Arg Ser Phe Met Asp Gly Phe Thr 
100 105 HO 

Thr Leu Lys Glu Asn He Met Arg Phe Trp Arg Ser Pro Asn Pro Gly 

115 120 125 

Ser Trp Val Ser Cys Glu Gin Val Leu Leu Ala Leu Leu Leu Leu Leu 
130 135 140 

Ala Leu Leu Leu Pro Leu Leu Ser Gly Gly Leu His Leu Leu Leu Lys 
30 145 150 155 160 



25 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 



( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..777 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

50 ATG GAC GCC TGG GTC CGC TTC AGT GCT CAG AGC CAA GCC CGG GAG CGG 48 

Met Asp Ala Trp Val Arg Phe Ser Ala Gin Ser Gin Ala Arg Glu Arg 
165 170 175 
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CTG TGT AGG GCC GCC CAG TAT GCT TGC TCT CTT CTT GGC CAT GCG CTG 96 

Leu Cys Arg Ala Ala Gin Tyr Ala Cys Ser Leu Leu Gly His Ala Leu 

180 185 190 

CAG AGG CAT GGA GCC AGT CCT GAG TTA CAG AAA CAG ATT CGA CAA CTG 144 

Gin Arg His Gly Ala Ser Pro Glu Leu Gin Lys Gin lie Arg Gin Leu 
195 200 205 

GAG AGC CAC CTG AGC CTT GGA AGA AAG CTT CTA CGC CTG GGT AAC TCA 192 

Glu Ser His Leu Ser Leu Gly Arg Lys Leu Leu Arg Leu Gly Asn Ser 
210 215 220 



GCA GAT GCC CTT GAG TCA GCC AAA AGA GCT GTT CAC CTA TCA GAT GTT 240 
Ala Asp Ala Leu Glu Ser Ala Lys Arg Ala Val His Leu Ser Asp Val 
15 225 230 235 240 

GTC CTG AGA TTC TGC ATC ACT GTT AGT CAC CTC AAT CGA GCC TTG TAG 288 
Val Leu Arg Phe Cys lie Thr Val Ser His Leu Asn Arg Ala Leu Tyr 
245 250 255 



TTC GCC TGT CAC AAT GTC CTG TGG GCT GGA AAG TCT GGA CTG GCT CCC 336 

Phe Ala Cya His Asn Val Leu Trp Ala Gly Lys Ser Gly Leu Ala Pro 
260 265 270 

CGT GTG GAT CAG GAG AAG TGG GCC CAG CGT TCA TTC AGG TAC TAT TTG 384 

Arg Val Asp Gin Glu Lys Trp Ala Gin Arg Ser Phe Arg Tyr Tyr Leu 

275 280 285 



ITT TCC CTC ATC ATG AAT TTG AGC CGT GAT GCT TAT GAG ATT CGC CTA 432 
Phe Ser Leu lie Met Asn Leu Ser Arg Asp Ala Tyr Glu He Arg Leu 
30 290 295 300 

CTG ATG GAG CAA GAG TCT TCT GCT TGT AGC CGG CGA CTG AAA GGT TCT 480 
Leu Met Glu Gin Glu Ser Ser Ala Cys Ser Arg Arg Leu Lys Gly Ser 
305 310 315 320 

35 GGA GGA GGA GTC CCA GGA GGA AGT GAA ACT GGG GGA CTT GGG GGA CCA 528 

Gly Gly Gly Val Pro Gly Gly Ser Glu Thr Gly Gly Leu Gly Gly Pro 
325 330 335 

GGG ACT CCA GGA GGA GGT CTG CCC CAA CTG GCT CTG AAA CTT CGG CTG 576 
40 Gly Tfcr Pro Gly Gly Gly Leu Pro Gin Leu Ala Leu Lys Leu Arg Leu 

340 345 350 

CAA GTC CTC CTC CTG GCT CGA GTC CTT AGA GGT CAT CCC CCA CTT CTG 624 
Gin Val Leu Leu Leu Ala Arg Val Leu Arg Gly His Pro Pro Leu Leu 
^ 355 360 365 

CTA GAC GTG GTC AGA AAT GCC TGT GAT CTC TTC ATT CCT CTG GAC AAA 672 
Leu Asp Val Val Arg Asn Ala Cys Asp Leu Phe He Pro Leu Asp Lys 
370 375 380 

50 CTA GCG CTC TGG CGC TGT GGC CCT GGG ATT GTG GGG CTT TGT GGC CTC 720 

Leu Ala Leu Trp Arg Cys Gly Pro Gly lie Val Gly Leu Cys Gly Leu 
385 390 395 400 
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GTG TCC TCC ATC CTG TCT ATT CTC ACC CTA ATC TAT CCC TGG CTA CGA 76 8 

val Ser Ser lie Leu Ser He Leu Thr Leu He Tyr Pro Tr P ^ u ^9 
405 410 415 

CTC AAG CCC TGACTTCCGG TACAGGATAA GGAGGGGACC TGAATTGGTG B17 
Leu Lys Pro 



10 AGATGGAATC TTAGATCGTC CCCATGTGCC AGCCTCATTC GAATTCTACT CTTTGGTTAA 877 

AG TT AG AAAT TCAGAGATTT AGGGGTGGAG GAGGAAGAGC TTTGGGGAAG ATGAGGTAAG 937 
GAAAGATGAC TCGTGAAGTT AATAGGATGT CTCTAATTTC TAGA 981 

15 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Asp Ala Trp Val Arg Phe Ser Ala Gin Ser Gin Ala Arg Glu Arg 
15 10 15 

Leu Cys Arg Ala Ala Gin Tyx Ala Cys Ser Leu Leu Gly His Ala Leu 
20 25 30 

Gin Arg His Gly Ala Ser Pro Glu Leu Gin Lys Gin He Arg Gin Leu 
35 40 45 

Glu Ser His Leu Ser Leu Gly Arg Lys Leu Leu Arg Leu Gly Asn Ser 
50 55 60 

Ala Asp Ala Leu Glu Ser Ala Lys Arg Ala Val His Leu Ser Asp Val 
40 65 70 75 80 

Val Leu Arg Phe Cys He Thr Val Ser His Leu Asn Arg Ala Leu Tyr 
85 90 95 
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Phe Ala Cys His Asn Val Leu Trp Ala Gly Lys Ser Gly Leu Ala Pro 
100 105 110 

Arg Val Asp Gin Glu Lys Trp Ala Gin Arg Ser Phe Arg Tyr Tyr lieu 
115 120 125 

Phe Ser Leu He Met Asn Leu Ser Arg Asp Ala Tyr Glu He Arg Leu 
130 135 140 
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Leu Met Glu Gin Glu Ser Ser Ala Cy3 Ser Arg Arg Leu Lys Gly Ser 
145 150 155 160 

5 Gly Gly Gly Val Pro Gly Gly Ser Glu Thr Gly Gly Leu Gly Gly Pro 

165 170 175 

Gly Thr Pro Gly Gly Gly Leu Pro Gin Leu Ala Leu Lys Leu Arg Leu 
180 185 190 

W Gin Val Leu Leu Leu Ala Arg Val Leu Arg Gly His Pro Pro Leu Leu 

195 200 205 

Leu Asp Val Val Arg Asn Ala Cys Asp Leu Phe He Pro Leu Asp Lys 
210 215 220 

15 

Leu Ala Leu Trp Arg Cys Gly Pro Gly He Val Gly Leu Cys Gly Leu 
225 230 235 240 

Val Ser Ser He Leu Ser lie Leu Thr Leu He Tyr Pro Trp Leu Arg 

20 245 250 255 



Leu Lys Pro 



25 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2166 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1503 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 

AGT CAA GTC GAG TTT CTT TGC ATG TCA CTG GCC AAT CCT CTT CCA ACA 48 
Ser Gin Val Glu Phe Leu Cys Met Ser Leu Ala Asn Pro Leu Pro Thr 
260 265 270 275 



46 



50 



AAT TAC ACQ TGG TAC CAC AAT GGG AAA GAA ATG CAG GGA AGG ACA GAG 96 
Asn Tyr Thr Trp Tyr His Asn Gly Lys Glu Met Gin Gly Arg Thr Glu 
280 285 290 

GAG AAA GTC CAC ATC CCA AAG ATC CTC CCC TGG CAC GCT GGG ACT TAT 144 
Glu Lys Val His He Pro Lys He Leu Pro Trp His Ala Gly Thr Tyr 
295 300 305 
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TCC TGT GTG GCA GAA AAC ATT CTT GGT ACT GGA CAG AGG GGC CCG GGA 192 
Ser Cys Val Ala Glu Asn He Leu Gly Thr Gly Gin Arg Gly Pro Gly 
310 315 320 

GCT GAG CTG GAT GTC CAG TAT CCT CCC AAG AAG GTG ACC ACA GTG ATT 24 0 

Ala Glu Leu Asp Val Gin Tyr Pro Pro Lys Lys Val Thr Thr Val He 
325 330 335 

CAA AAC CCC ATG CCG ATT CGA GAA GGA GAC ACA GTG ACC CTT TCC TGT 2 88 

Gin Asn Pro Met Pro He Arg Glu Gly Asp Thr Val Thr Leu Ser Cys 
340 345 350 355 

AAC TAC AAT TCC AGT AAC CCC ACT GTT ACC CGG TAT GAA TGG AAA CCC 336 
Asn Tyr Asn Ser Ser Asn Pro Ser Val Thr Arg Tyr Glu Trp Lys Pro 
360 365 370 

CAT GGC GCC TGG GAG GAG CCA TCG CTT GGG GTG CTG AAG ATC CAA AAC 3 84 

His Gly Ala Trp Glu Glu Pro Ser Leu Gly Val Leu Lys lie Gin Asn 
375 380 385 

GTT GGC TGG GAC AAC ACA ACC ATC GCC TGC GCA CGT TGT AAT AGT TGG 43 2 

Val Gly Trp Asp Asn Thr Thr He Ala Cys Ala Arg Cys Asn Ser Trp 
390 395 400 

TGC TCG TGG GCC TCC CCT GTC GCC CTG AAT GTC CAG TAT GCC CCC CGA 48 0 

Cys Ser Trp Ala Ser Pro Val Ala Leu Asn Val Gin Tyr Ala Pro Arg 
405 410 415 

GAC GTG AGG GTC CGG AAA ATC AAG CCC CTT TCC GAG ATT CAC TCT GGA 52 8 

Asp Val Arg val Arg Lys He Lys Pro Leu Ser Glu He His Ser Gly 
420 425 430 435 

AAC TCG GTC AGC CTC CAA TGT GAC TTC TCA AGC AGC CAC CCC AAA GAA 576 
Asn Ser Val Ser Leu Gin Cys Asp Phe Ser Ser Ser His Pro Lys Glu 
440 445 450 

GTC CAG TTC TTC TGG GAG AAA AAT GGC AGG CTT CTG GGG AAA GAA AGC 624 
Val Gin Phe Phe Trp Glu Lys Asn Gly Arg Leu Leu Gly Lys Glu Ser 
455 460 465 

CAG CTG AAT TTT GAC TCC ATC TCC CCA GAA GAT GCT GGG AGT TAC AGC 672 
Gin Leu Asn Phe Asp Ser He Ser Pro Glu Asp Ala Gly Ser Tyr Ser 
« 470 475 480 

TGC TGG GTG AAC AAC TCC ATA GGA CAG ACA GCG TCC AAG GCC TGG ACA 720 
Cys Trp Val Asn Asn Ser He Gly Gin Thr Ala Ser Lys Ala Trp Thr 
485 490 495 
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CTT GAA GTG CTG TAT GCA CCC AGG AGG CTG CGT GTG TCC ATG AGC CCG 
Leu Glu val Leu Tyr Ala Pro Arg Arg Leu Arg Val Ser Met Ser Pro 
500 505 510 515 

GGG GAC CAA GTG ATG GAG GGG AAG AGT GCA ACC CTG ACC TGT GAG AGT 
Gly Asp Gin Val Met Glu Gly Lys Ser Ala Thr Leu Thr Cys Glu Ser 
520 525 S30 
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GAC GCC AAC CCT CCC GTC TCC CAC TAC ACC TGG TTT GAC TGG AAT AAC 8 64 

Asp Ala Asn Pro Pro Val Ser His Tyr Thr Trp Phe Asp Trp Asn Asn 
535 540 545 

CAA AGC CTC CCC CAC CAC AGC CAG AAG CTG AGA TTG GAG CCG GTG AAG 912 
Gin Ser Leu Pro His Hie Ser Gin Lys Leu Arg Leu Glu Pro Val Lys 
550 555 560 

GTC CAG CAC TCG GGT GCC TAC TGG TGC CAG GGG ACC AAC AGT GTG GGC 960 
Val Gin His Ser Gly Ala Tyr Trp Cys Gin Gly Thr Asn Ser Val Gly 
565 570 575 

AAG GGC CGT TCG CCT CTC AGC ACC CTT ACT GTC TAC TAT AGC CCG GAG 1008 
Lys Gly Arg Ser Pro Leu Ser Thr Leu Thr Val Tyr Tyr Ser Pro Glu 
580 585 590 595 

ACC ATC GGC AGG CGA GTG GCT GTG GGA CTC GGG TCC TGC CTC GCC ATC 10 56 

Thr He Gly Arg Arg Val Ala Val Gly Leu Gly Ser Cys Leu Ala He 
600 605 610 

CTC ATC CTG GCA ATC TGT GGG CTC AAG CTC CAG CGA CGT TGG AAG AGG 1104 
Leu He Leu Ala He Cys Gly Leu Lys Leu Gin Arg Arg Trp Lys Arg 
615 €20 625 

26 ACA CAG AGC CAG CAG GGG CTT CAG GAG AAT TCC AGC GGC CAG AGC TTC 1152 

Thr Gin Ser Gin Gin Gly Leu Gin Glu Asn Ser Ser Gly Gin Ser Phe 
630 635 640 



15 



20 



30 



50 



TTT GTG AGG AAT AAA AAG GTT AGA AGG GCC CCC CTC TCT GAA GGC CCC 1200 
Phe Val Arg Asn Lys Lys Val Arg Arg Ala Pro Leu Ser Glu Gly Pro 
645 650 655 



CAC TCC CTG GGA TGC TAC AAT CCA ATG ATG GAA GAT GGC ATT AGC TAC 124B 
His Ser Leu Gly Cys Tyr Asn Pro Met Met Glu Asp Gly He Ser Tyr 
& 660 665 670 675 

ACC ACC CTG CGC TTT CCC GAG ATG AAC ATA CCA CGA ACT GGA GAT GCA 1296 
Thr Thr Leu Arg Phe Pro Glu Met Asn He Pro Arg Thr Gly Asp Ala 
680 685 690 

40 

GAG TCC TCA GAG ATG CAG AGA CCT CCC CGG ACC TGC GAT GAC ACG GTC 1344 
Glu Ser Ser Glu Met Gin Arg Pro Pro Arg Thr Cys Asp Asp Thr Val 
695 700 705 

46 ACT TAT TCA GCA TTG CAC AAG CGC CAA GTG GGC GAC TAT GAG AAC GTC 1392 

Thr Tyr Ser Ala Leu His Lys Arg Gin Val Gly Asp Tyr Glu Asn Val 
710 715 720 



ATT CCA GAT TTT CCA GAA GAT GAG GGG ATT CAT TAC TCA GAG CTG ATC 1440 
He Pro Asp Phe Pro Glu Asp Glu Gly He His Tyr Ser Glu Leu He 
725 730 735 
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CAG TTT GGG GTC GGG GAG CGG CCT CAG GCA CAA GAA AAT GTG GAC TAT 148 8 
Gin Phe Gly Val Gly Glu Arg Pro Gin Ala Gin Glu Asn Val Asp Tyr 
740 745 750 755 

GTG ATC CTC AAA CAT TGACACTGGA TGGGCTGCAG CAGAGGCACT GGGGGCAGCG 154 3 
Val He Leu Lys His 
760 

GGGGCCAGGG AAGTCCCCGA GTTTCCCCAG ACACCGCCAC ATGGCTTCCT CCTGCGTGCA 1603 

TGTGCGCACA CACACACACA CACGCACACA CACACACACA CACTCACTGC GGAGAACCTT 1663 

GTGCCTGGCT CAGAGCCAGT CTTTTTGGTG AGGGTAACCC CAAACCTCCA AAACTCCTGC 1723 

CCCTGTTCTC TTCCACTCTC CTTGCTACCC AGAAATCATC TAAATACCTG CCCTGACATG 1783 

CACACCTCCC CTGCCCCACC AGCCCACTGG CCATCTCCAC CCGGAGCTGC TGTGTCCTCT 1843 

GGATCTGCTC GTCATTTTCC TTCCCTTCTC CATCTCTCTG GCCCTCTACC CCTGATCTGA 1903 

CATCCCCACT CACGAATATT ATGCCCAGTT TCTGCCTCTG AGGGAAAGCC CAGAAAAGGA 1963 

CAGAAACGAA GTAGAAAGGG GCCCAGTCCT GGCCTGGCTT CTCCTTTGGA AGTGAGGCAT 2023 

25 TGCACGGGGA GACGTACGTA TCAGCGGCCC CTTGACTCTG GGGACTCCGG GTTTGAGATG 2083 

GACACACTGG TGTGGATTAA CCTGCCAGGG AGACAGAGCT CACAATAAAA ATGGCTCAGA 2143 
TGCCACTTCA AAGAAAAAAA AAA 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ser Gin Val Glu Phe Leu Cys Met Ser Leu Ala Asn Pro Leu Pro Thr 
! 5 10 15 

Asn Tyr Thr Trp Tyr His Asn Gly Lys Glu Met Gin Gly Arg Thr Glu 

20 25 30 

Glu Lys Val His He Pro Lys He Leu Pro Trp His Ala Gly Thr Tyr 
35 40 45 

Ser Cys Val Ala Glu Asn He Leu Gly Thr Gly Gin Arg Gly Pro Gly 
50 55 60 
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Ala Glu Leu Asp Val Gin Tyr Pro Pro Lys Lys Val Thr Thr Val lie 
65 70 75 80 

Gin Asn Pro Met Pro lie Arg Glu Gly Asp Thr Val Thr Leu Ser Cys 

85 90 95 

Asn Tyr Asn Ser Ser Asn Pro Ser Val Thr Arg Tyr Glu Trp Lye Pro 
100 105 110 

His Gly Ala Trp Glu Glu Pro Ser Leu Gly Val Leu Lys He Gin Asn 

115 120 125 

Val Gly Trp Asp Asn Thr Thr He Ala Cys Ala Arg Cys Asn Ser Trp 
130 135 140 

Cys Ser Trp Ala Ser Pro Val Ala Leu Asn Val Gin Tyr Ala Pro Arg 
145 150 155 160 

Asp val Arg Val Arg Lys He Lys Pro Leu Ser Glu He His Ser Gly 
165 170 175 

Asn Ser Val Ser Leu Gin Cys Asp Phe Ser Ser Ser His Pro Lys Glu 
180 185 190 

Val Gin Phe Phe Trp Glu Lys Asn Gly Arg Leu Leu Gly Lys Glu Ser 
195 200 205 

Gin Leu Asn Phe Asp Ser He Ser Pro Glu Asp Ala Gly Ser Tyr Ser 
210 215 220 

Cys Trp Val Asn Asn Ser He Gly Gin Thr Ala Ser Lys Ala Trp Thr 
225 230 235 240 

Leu Glu Val Leu Tyr Ala Pro Arg Arg Leu Arg Val Ser Met Ser Pro 
245 250 255 

Gly Asp Gin Val Met Glu Gly Lya Ser Ala Thr Leu Thr Cys Glu Ser 
260 265 270 

Asp Ala Asn Pro Pro Val ser His Tyr Thr Trp Phe Asp Trp Asn Asn 
275 280 285 

Gin Ser Leu Pro His His Ser Gin Lys Leu Arg Leu Glu Pro Val Lys 
290 295 300 

Val Gin His Ser Gly Ala Tyr Trp Cys Gin Gly Thr Asn Ser Val Gly 
305 310 315 320 

Lys Gly Arg Ser Pro Leu Ser Thr Leu Thr Val Tyr Tyr Ser Pro Glu 
325 330 335 

Thr He Gly Arg Arg Val Ala Val Gly Leu Gly Ser Cys Leu Ala He 
340 345 350 
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Leu lie Leu Ala lie Cys Gly Leu Lys Leu Gin Arg Arg Trp Lys Arg 
355 360 365 

Thr Gin Ser Gin Gin Gly Leu Gin Glu Asn Ser Ser Gly Gin Ser Phe 
370 375 3B0 

Phe Val Arg Asn Lys Lys Val Arg Arg Ala Pro Leu Ser Glu Gly Pro 
385 390 395 400 

His Ser Leu Gly Cys Tyr Asn Pro Met Met Glu Asp Gly He Ser Tyr 
405 410 415 

Thr Thr Leu Arg Phe Pro Glu Met Asn He Pro Arg Thr Gly Asp Ala 
420 425 430 

Glu Ser Ser Glu Met Gin Arg Pro Pro Arg Thr Cys Asp Asp Thr Val 
435 440 445 

Thr Tyr Ser Ala Leu His Lys Arg Gin Val Gly Asp Tyr Glu Asn Val 
450 455 460 

He Pro Asp Phe Pro Glu Asp Glu Gly He His Tyr Ser Glu Leu He 
465 470 475 480 

Gin Phe Gly Val Gly Glu Arg Pro Gin Ala Gin Glu Asn Val Asp Tyr 
485 490 495 

Val He Leu Lys His 
500 

(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

35 (d) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Tyr Arg Gly Asp Arg Ala Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 33: 
Pro Glu Ser Thr 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 
75 (B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Leu Ala Thr Asp Met Ser Cys His Phe Gin Gin Val Lys Thr Met Lys 
15 10 15 

Thr Ala Leu Gin Gin Leu Glu Arg He Asp Lys Ser Lys Ala Leu Ser 
20 25 30 

Leu Leu Leu His Ala Ala Asp He Ser His Pro Thr Lys Gin Trp Ser 
35 40 45 

Trp His Ser Arg Trp Thr Lys Ala Leu Met 

35 SO 55 

(2) INFORMATION FOR SBQ ID NO: 35: 



30 



40 



46 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

SO 

He Val Phe Ser Ala Cys Arg Met Pro Pro Ser His Gin Leu Asp His 
15 10 15 
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Ser Lys Leu Leu Gly Tyr Leu Lys His Thr Leu Asp Gin Tyr Val Glu 
20 25 30 

Ser Asp Tyr Thr Leu Leu Tyr Leu His His Gly Leu Thr Ser Asp Aan 

35 40 45 

Lys Pro Ser Leu Ser Trp Leu Arg Asp Ala Tyr Arg Glu Phe Asp Arg 

50 55 60 

Lys Tyr Lys Lys Asn He Lys Ala Leu Tyr He Val His Pro Thr Met 
65 70 75 80 

Phe He Lys Thr Leu Leu He Leu Phe Lys Pro Leu He Ser Phe Lys 



85 



90 95 



Phe Gly Gin Lys He Phe Tyr Val Asn Tyr Leu Ser Glu Leu Ser Glu 
100 105 HO 

His Val Lys Leu Glu Gin Leu Gly He Pro Arg Gin Val 

115 120 125 



Claims 

1 . A nucleic acid molecule that encodes a polypeptide which binds to a site of a molecule which regulates cell surviva 
wherein said nucleic acid molecule comprises: 

(a) the nucleic acid sequence shown in SEQ ID NO: 20; 

(b) the coding region of the nucleic acid sequence shown in SEQ ID NO: 20; 

(c) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 21 ; 

(d) the nucleic acid sequence shown in SEQ ID NO: 22; 

(e) the coding region of the nucleic acid sequence shown in SEQ ID NO: 22; 

(f) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 23; 

(g) the nucleic acid sequence shown in SEQ ID NO: 24; 

(h) the coding region of the nucleic acid sequence shown in SEQ ID NO: 24; 

(i) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 25; 
(j) the nucleic acid sequence shown in SEQ ID NO: 26; 

(k) the coding region of the nucleic acid sequence shown in SEQ ID NO: 26; 

(I) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 27; 

(m) the nucleic acid sequence shown in SEQ ID NO: 28; 

(n) the coding region of the nucleic acid sequence shown in SEQ ID NO: 28; 

(o) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 29; 

(p) the nucleic acid sequence shown in SEQ ID NO: 30; 

(q) the coding region of the nucleic acid sequence shown in SEQ ID NO: 30; 

(r) a nucleic acid sequence which encodes the amino acid sequence shown in SEQ ID NO: 31; or 

(s) a nucleic acid sequence which hybridizes to any one of (a) to (r). 

2. The nucleic acid molecule of claim 1 , wherein said site is fifteen amino acids or less in size. 

3. The nucleic acid molecule of claim 1 . wherein said site is ten amino acids or less in size. 

4 The nucleic acid molecule of any one of claims 1 to 3, wherein said site is SEQ ID NO:1 . SEQ ID NO:2. SEQ 
NO:3, SEQ ID NO:4, SEQ ID NO:5. SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:32. 

5. The nucleic acid molecule of any one of claims 1 to 4 which is a DNA molecule. 
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6. A vector comprising the DNA molecule of claim 5. 

7. The vector of claim 6 wherein said DNA molecule is operably linked to a promoter. 

8. A host cell transformed with the vector of claim 7 or 8. 

9. A method of producing a polypeptide which binds to a site of a molecule which regulates cell survival, comprising: 

(a) curturing the host cell of claim 8; and 

(b) recovering the polypeptide expressed by the host cell. 

1 0. A polypeptide which binds to a site of a molecule which regulates cell survival produced by the method of claim 7 
or encoded by the nucleic acid molecule of any one of claims 1 to 5. 

11. The polypeptide of claim 10, having the amino acid sequence of SEQ ID NO:21, SEQ ID NO:23, SEQ ID N025, 
SEQ ID NO:27, SEQ ID NO:29or SEQ ID NO:31. 

12. A nucleic acid molecule that encodes the site of a molecule which regulates cell survival to which the polypeptide 
of claim 10 or 11 binds. 

13. A pharmaceutical composition comprising the nucleic acid molecule of any one of claims 1 to 5 or 12; the polypep- 
tide of claim 10 or 1 1 ; or an oligopeptide encoded by the nucleic acid molecule of claim 12; and, optionally, a phar- 
maceutical^ acceptable carrier. 

14. A diagnostic compostion comprising the nucleic acid molecule of any one of claims 1 to 5 or 12; the polypeptide of 
claim 10 or 1 1 ; or an oligopeptide encoded by the nucleic acid molecule of claim 12. 

15. Use of the pharmaceutical composition of claim 13 for the preparation of a medicament for regulating cell death, 
neutralizing activity of a molecule which regulates cell survival, treating or ameliorating immune disorders, or treat- 
ing or ameliorating abnormal states characterized by aberrant cell growth. 

16. A method for regulating cell death in vitro comprising exposing said cell to the polypeptide of claim 10 or 1 1 . 

17. A method for neutralizing activity of a molecule which regulates cell survival in vitro comprising exposing a cell to 
the polypeptide of claim 10 or 1 1 

18. The method of claim 17, wherein said molecule which regulates cell survival is Bcl-2. 

19. The method of any one of claims 16 to 18, wherein said cell is exposed to said polypeptide by transfecting said cell 
with the nucleic acid molecule of arty one of claims 1 to 5. 

20. A method for identifying a molecule which regulates cell survival, comprising: 

(a) lysing cells suspected of corrprising a molecule which regulates cell survival to produce a lysate; 

(b) exposing said lysate to the polypeptide of claim 10 or 1 1 ; and 

(c) determing the presense of molecule-polypeptide aggregates. 
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